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NUCLEIC ACIDS AND PROTEINS 
RELATED TO ALZHEIMER'S DISEASE, 
AND USES THEREFOR 

5 

Field of the Invention 

The present invention relates generally to the field of neurological and 
physiological dysfunctions associated with Alzheimer's Disease. More particularly, 
10 the invention is concerned with the identification, isolation and cloning of genes 
which are associated with Alzheimer's Disease, as well as their coiresponding 
transcripts and protein products. The present invention also relates to methods for 
detecting and diagnosing carriers of normal and mutant alleles of these genes, to 
methods for detecting and diagnosing Alzheimer's Disease, to methods of identifying 
other genes and proteins related to, or interacting with, the genes and proteins of the 
invention, to methods of screening for potential therapeutics for Alzheimer's Disease, 
to methods of treatment for Alzheimer's Disease, and to cell lines and animal models 
useful in screening for and evaluating potentially useful therapies for Alzheimer's 
Disease. 

Backgroun d of the Invention 
Alzheimer's Disease (AD) is a degenerative disorder of the human central 
nervous system characterized by progressive memory impairment and cognitive and 
intellectual decline during mid to late adult life (Katzman, 1986). The disease is 
accompanied by a constellation of neuro-pathologic features principal amongst which 
are the presence of extracellular amyloid or senile plaques, and neurofibrillary tangles 
in neurons. The etiology of this disease is complex, although in some families it 
appears to be inherited as an autosomal dominant trait. Linkage studies have 
identified three genes associated with the development of AD: p-amyloid precursor 
protein (APP) (Chartier-Harlin et al., 1991; Goate et al., 1991; Murrell et al., 1991; 
Karlinskyetal., 1992; Mullan et al., 1992), presenilin-1 (PS- 1) (Sherrington', 1995), 
and presenilin-2 (PS-2) (Rogaev, 1 995, and Levy-Lahad, 1 995). 

The presenilis are multi-spanning membrane proteins which were described 
in substantial detail in PCT Publication WO96/34099, the entire disclosure of which 
is incorporated herein by reference. Although the functions of the presenilis are 
unknown, a number of autosomal dominant presenilin mutations have been identified 
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which are strongly associated with the development of early-onset, aggressive, 
Familial Alzheimer's Disease (FAD). 

The present disclosure describes the identification, isolation, sequencing and 
characterization of several human genes which interact with the presenilins, mutations 
5 in which may lead to FAD. These presenilin-interacting protein genes may be 
involved in the pathways which, when affected by mutant presenilins, lead to the 
development of Alzheimer's Disease. In addition, mutations in the presenilin- 
interacting protein genes, even in the absence of defects in the presenilins, may be 
causative of Alzheimer's Disease. 



sequencing and characterization of several human genes, referred to herein as 
"presenilin-interacting protein genes" or "PS-interacting protein genes." The products 
of these genes are believed to interact in vivo with the human presenilin-1 proteins 
and, therefore, are implicated in the biochemical pathways which are affected in 
Alzheimer's Disease. Each of these genes, therefore, presents a new therapeutic target 
for the treatment of Alzheimer's Disease. In addition, PS-interacting protein nucleic 
acids, PS-interacting proteins and peptides, antibodies to the PS-interacting proteins, 
cells transformed with PS-interacting protein nucleic acids, and transgenic animals 
altered with PS-interacting protein nucleic acids, all possess various utilities, as 
described herein, for the diagnosis, therapy and continued investigation of Alzheimer's 
Disease and related disorders. 



/Thus, it is one object of the invention to provide isolated nucleic acids 
encoding at least a PS -interacting domain of a PS-interacting protein. These PS- 
interacting proteins include mammalian S5a subunits of the 26S proteasom^f the 



GT24 protein, the p0071 protein, the Rabll protein, the retinoid X receptor-0, the 
cytoplasmic chaperonin, and several sequences identified herein as clones Y2H35, 
Y2H171, and Y2H41 . Preferred nucleotide and amino acid sequences are provided 
herein. It is another object of the invention to provide probes and primers for these 



Summary of the Invention 
The present invention is based, in part, upon the identification, isolation, 
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PS-interacting protein genes, and to provide nucleic acids which encode small 
antigenic determinants of these genes. Therefore, preferred embodiments include 
sequences of at least 10, 15 or 20 consecutive nucleotides selected from the disclosed 
sequences. 

5 Using the nucleic acid sequences and antibodies disclosed and enabled 

herein, methods for identifying allelic variants or heterospecific homologues of a 
human PS-interacting protein and gene are provided. The methods may be practiced 
using nucleic acid hybridization or amplification techniques, immunochemical 
techniques, or any other technique known in the art. The allelic variants may include 

10 other normal human alleles as well as mutant alleles of the PS-interacting protein 
genes which may be causative of Alzheimer's Disease. The heterospecific 
homologues may be from other mammalian species, such as mice, rats, dogs, cats or 
non-human primates, or may be from invertebrate species, such as Drosoohila or 
ejegans. Thus, it is another object of the invention to provide nucleic acids which 

15 encode allelic or heterospecific variants of the disclosed sequences, as well as the 
allelic or heterospecific proteins encoded by them. 

The it another object of the invention to provide vectors, and particularly 
expression vectors, which include any of the above-described nucleic acids. It is a 
further object of the invention to provide vectors in which PS-interacting protein 
20 nucleic acid sequences are operably joined to exogenous regulatory regions to produce 
altered patterns of expression, or to exogenous coding regions to produce fusion 
proteins. Conversely, it is another object to provide nucleic acids in which PS- 
interacting protein regulatory regions are operably joined to exogenous coding 
regions, including standard marker genes, to produce constructs in which the 
regulation of PS-interacting protein genes may be studied and used in assays for 
therapeutics. 

It is another object of the invention to provide host cells and transgenic 
animals which have been transformed with any of the above-described nucleic acids 



25 
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of the invention. The host cells may be prokaryotic or eukaryotic cells and, in 
particular, may be gametes, zygotes, fetal cells, or stem cells useful in producing 
transgenic animal models. 

In particularly preferred embodiments, the present invention provides a 
5 non-human animal model for Alzheimer's Disease, in which the genome of the 
animal, or an ancestor thereof, has been modified by at least one recombinant 
construct which has introduced one of the following modifications: (1 ) insertion of 
nucleotide sequences encoding at least a functional domain of a heterospecific normal 
PS-interacting protein, (2) insertion of nucleotide sequences encoding at least a 
10 functional domain of a heterospecific mutant PS-interacting protein, (3) insertion of 
nucleotide sequences encoding at least a functional domain of a conspecific 
homologue of a heterospecific mutant PS-interacting protein, and (4) inactivation of 
an endogenous PS-interacting protein gene. Preferred transgenic animal models are 
rats, mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, and non- 
15 human primates, but invertebrates are also contemplated for certain utilities. 

It is another object of the invention to provide methods for producing at 
least a functional domain of a PS-interacting protein using the nucleic acids of the 
invention. In addition, the present invention also provides substantially pure 
preparations of such proteins, including short peptide sequences for used as 
20 immunogens. Thus, the invention provides peptides comprising at least 10 or 15 

consecutive amino acid residues from the disclosed and otherwise enabled sequences. 
The invention further provides substantially pure preparations of peptides which 
comprise at least a PS-interacting domain of a PS-interacting protein, as well as 
substantially pure preparations of the entire proteins. 

25 Using the substantially pure peptides and proteins enabled herein, the 

invention also provides methods for producing antibodies which selectively bind to a 
PS -interacting protein, as well as cell lines which produce these antibodies. 



SUBSTITUTE SHEET (RULE 26) 



WO 97/27296 

PCT/CA97/0005I 



-5- 



15 



20 



25 



Another object of the present invention is to provide methods of 
identifying compounds which may have utility in the treatment of Alzheimer's 
Disease and related disorders. These methods include methods for identifying 
compounds which can modulate the expression of a PS-interacting protein gene, 
i methods for identifying compounds which can selectively bind to a PS-interacting 
protein, and methods of identifying compounds which can modulate activity of a PS- 
interacting protein. These methods may be conducted in vitro or in vivo, and may 
employ the transformed cell lines and transgenic animal models of the invention. The 
methods also may be part of a clinical trial in which compounds identified by the 
methods of the invention are further tested in human subjects. 

It is another object of the invention to provide methods of diagnosing or 
screening for inherited forms of Alzheimer's Disease by determining if a subject bears 
a mutant PS-interacting protein gene. Mutant PS-interacting genes may be detected 
by assays including direct nucleotide sequencing, probe specific hybridization, 
restriction enzyme digest and mapping, PCR mapping, ligase-mediated PGR 
detection, RNase protection, electrophoretic mobility shift detection, or chemical 
mismatch cleavage. Alternatively, mutant forms of a PS-interacting protein may be 
detected by assays including immunoassays, protease assays, or electrophoretic 
mobility assays. 

It is also an object of the invention to provide pharmaceutical preparations 
which may be used in the treatment of Alzheimer's Disease 
which result from aberration in biochemical pathways involving the PS-interacting 
prmeinsdisclosed and enabled herein. Thus, the present invention alsopro^T^ 
pharmaceutical preparations comprising a substantially pure PS-interacting protein an 
expression vector operably encoding a PS -interacting protein, an expression vector 
operably encoding a PS-interacting protein antisense sequence, an antibody which 
selectively binds to a mutant PS-interacting protein, or an antigenic determinant of a 
mutant PS-interacting protein. These pharmaceutical preparations may be used to 
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treat a patient bearing a mutant PS-interacting protein gene which is causative of 
Alzheimer's Disease or related disorders. 

These an other objects of the present invention are described more fully in 
the following specification and appended claims. 

5 Detailed Description of the Invention 

I. Definitions 

In order to facilitate review of the various embodiments of the invention, 
and an understanding of the various elements and constituents used in making and 
using the invention, the following definitions are provided for particular terms used in 

10 the description and appended claims: 

Presenilin. As used without further modification herein, the terms 
"presenilin" or "presenilins" mean the presenilin- 1 (PS1) and/or the presenilin-2 (PS2) 
genes/proteins. In particular, the unmodified terms "presenilin" or ,4 presenilins" refer 
to the mammalian PS1 and/or PS2 genes/proteins and, preferably, the human PS1 

15 and/or PS2 genes/proteins as described and disclosed in PCT Publication 
WO96/34099. 

Normal. As used herein with respect to genes, the term "normal" refers to 
a gene which encodes a normal protein. As used herein with respect to proteins, the 
term "normal" means a protein which performs its usual or normal physiological role 

20 and which is not associated with, or causative of, a pathogenic condition or state. 

Therefore, as used herein, the term "normal" is essentially synonymous with the usual 
meaning of the phrase "wild type." For any given gene, or corresponding protein, a 
multiplicity of normal allelic variants may exist, none of which is associated with the 
development of a pathogenic condition or state. Such normal allelic variants include, 

25 but are not limited to, variants in which one or more nucleotide substitutions do not 
result in a change in the encoded amino acid sequence. 

Mutant. As used herein with respect to genes, the term "mutant" refers to 
a gene which encodes a mutant protein. As used herein with respect to proteins, the 
term "mutant" means a protein which does not perform its usual or normal 
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physiological role and which is associated with, or causative of, a pathogenic 
condition or state. Therefore, as used herein, the term "mutant" is essentially 
synonymous with the terms "dysfunctional," "pathogenic," "disease-causing," and 
"deleterious." With respect to the presenilin and presenilin-interacting protein genes 
and proteins of the present invention, the term "mutant" refers to genes/proteins 
bearing one or more nucleotide/amino acid substitutions, insertions and/or deletions 
which typically lead to the development of the symptoms of Alzheimer's Disease 
and/or other relevant inheritable phenotypes (e.g. cerebral hemorrhage, mental 
retardation, schizophrenia, psychosis, and depression) when expressed in humans. 
This definition is understood to include the various mutations that naturally exist, 
including but not limited to those disclosed herein, as well as synthetic or recombinant 
mutations produced by human intervention. The term "mutant," as applied to these 
genes, is not intended to embrace sequence variants which, due to the degeneracy of 
the genetic code, encode proteins identical to the normal sequences disclosed or 
otherwise enabled herein; nor is it intended to embrace sequence variants which, 
although they encode different proteins, encode proteins which are functionally 
equivalent to normal proteins. 

Substantially pure As used herein with respect to proteins (including 
antibodies) or other preparations, the term "substantially pure" means that the 
20 preparation is essentially free of other substances to an extent practical and 

appropriate for its intended use. In particular, a protein preparation is substantially 
pure if it is sufficiently free from other biological constituents so as to be useful in, for 
example, generating antibodies, sequencing, or producing pharmaceutical 
preparations. By techniques well known in the art, substantially pure proteins or 
peptides may be produced in light of the nucleic acid and amino acid sequences 
disclosed herein. In particular, in light of the nucleic acid and amino acid sequences 
disclosed herein, one of ordinary skill in the art may, by application or serial 
application of well-known methods including HPLC or immune-affinity 
chromatography or electrophoretic separation, obtain proteins or peptides of any 
generally feasible purity. Preferably, but not necessarily, "substantially pure" 



25 
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preparations include at least 60% by weight (dry weight) the compound of interest. 
More preferably the preparation is at least 75% or 90%, and most preferably at least 
99%, by weight the compound of interest. Purity can be measured by any appropriate 
method, e.g., column chromatography, gel electrophoresis, or HPLC analysis. With 
5 respect to proteins, including antibodies, if a preparation includes two or more 

different compounds of interest (e.g., two or more different antibodies, immunogens, 
functional domains, or other polypeptides of the invention), a "substantially pure" 
preparation is preferably one in which the total weight (dry weight) of all the 
compounds of interest is at least 60% of the total dry weight. Similarly, for such 

10 . preparations containing two or more compounds of interest, it is preferred that the 
total weight of the compounds of interest be at least 75%, more preferably at least 
90%, and most preferably at least 99%, of the total dry weight of the preparation. 
Finally, in the event that the protein of interest is mixed with one or more other 
proteins (e.g., serum albumin) or compounds (e.g., diluents, excipients, salts, 

15 polysaccharides, sugars, lipids) for purposes of administration, stability, storage, and 
the like, such other proteins or compounds may be ignored in calculation of the purity 
of the preparation. 

Isolated nucleic acid. As used herein, an "isolated nucleic acid" is a 
ribonucleic acid, deoxyribonucleic acid, or nucleic acid analog comprising a 

20 polynucleotide sequence that is isolated or separate from sequences that are 

immediately contiguous (one on the 5' end and one on the 3 f end) in the naturally 
occurring genome of the organism from which it is derived. The term therefore 
includes, for example, a recombinant nucleic acid which is incorporated into a vector, 
into an autonomously replicating plasmid or virus, or into the genomic DNA of a 

25 prokaryote or eukaryote; or which exists as a separate molecule (e.g., a cDNA or a 
genomic DNA fragment produced by PCR or restriction endonuclease treatment) 
independent of other sequences. It also includes a recombinant DNA which is part of 
a hybrid gene encoding additional polypeptide sequences and/or including exogenous 
regulatory elements. 
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Substantially identical sequence As used herein, a "substantially 
identical" amino acid sequence is an amino acid sequence which differs only by 
conservative amino acid substitutions, for example, substitution of one amino acid for 
another of the same class (e.g., valine for glycine, arginine for lysine, etc.) or by one 
or more non-conservative substitutions, deletions, or insertions located at positions of 
the amino acid sequence which do not destroy the function of the protein (assayed, 
e.g., as described herein). Preferably, such a sequence is at least 85%, more 
preferably 90%, and most preferably 95% identical at the amino acid level to the 
sequence of the protein or peptide to which it is being compared. For nucleic acids, 
the length of comparison sequences will generally be at least 50 nucleotides, 
preferably at least 60 nucleotides, more preferably at least 75 nucleotides, and most 
preferably 1 10 nucleotides. A "substantially identical" nucleic acid sequence codes 
for a substantially identical amino acid sequence as defined above. 

Transformed 0611 M used herein, a "transformed cell" is a cell into which 
(or into an ancestor of which) has been introduced, by means of recombinant DNA 
techniques, a nucleic acid molecule of interest. The nucleic acid of interest will 
typically encode a peptide or protein. The transformed cell may express the sequence 
of interest or may be used only to propagate the sequence. The term "transformed" 
may be used herein to embrace any method of introducing exogenous nucleic acids 
20 including, but not limited to, transformation, transfection, electroporation, 
microinjection, viral-mediated transfection, and the like. 

Operably joined, M used herein » a c <x«ng sequence and a regulatory region are 
said to be "operably joined" when they are covalently linked in such a way as to place 
the expression or transcription of the coding sequence under the influence or control 
of the regulatory region. If it is desired that the coding sequences be translated into a 
functional protein, two DNA sequences are said to be operably joined if induction of 
promoter function results in the transcription of the coding sequence and if the nature 
of the linkage between the two DNA sequences does not (1) result in the introduction 
of a frame-shift mutation, (2) interfere with the ability of the regulatory region to 
direct the transcription of the coding sequences, or (3) interfere with the ability of the 
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corresponding RNA transcript to be translated into a protein. Thus, a regulatory 
region would be operably joined to a coding sequence if the regulatory region were 
capable of effecting transcription of that DNA sequence such that the resulting 
transcript might be translated into the desired protein or polypeptide. 
5 Stringent hybridization conditions. Stringent hybridization conditions is a term 

of art understood by those of ordinary skill in the art. For any given nucleic acid 
sequence, stringent hybridization conditions are those conditions of temperature, 
chaotrophic acids, buffer, and ionic strength which will permit hybridization of that 
nucleic acid sequence to its complementary sequence and not to substantially different 
10 sequences. The exact conditions which constitute "stringent" conditions, depend upon 
the nature of the nucleic acid sequence, the length of the sequence, and the frequency 
of occurrence of subsets of that sequence within other non-identical sequences. By 
varying hybridization conditions from a level of stringency at which non-specific 
hybridization occurs to a level at which only specific hybridization is observed, one of 
15 ordinary skill in the art can, without undue experimentation, determine conditions 
which will allow a given sequence to hybridize only with complementary sequences. 
Suitable ranges of such stringency conditions are described in Krause and Aaronson 
(1991). Hybridization conditions, depending upon the length and commonality of a 
sequence, may include temperatures of 20 P C-65°C and ionic strengths from 5x to 0.1 x 
20 SSC. Highly stringent hybridization conditions may include temperatures as low as 
40-42°C (when denaturants such as formamide are included) or up to 60-65 °C in ionic 
strengths as low as 0. lx SSC. These ranges, however, are only illustrative and, 
depending upon the nature of the target sequence, and possible future technological 
developments, may be more stringent than necessary. Less than stringent conditions 
are employed to isolate nucleic acid sequences which are substantially similar, allelic 
or homologous to any given sequence. 

Selectively binds. As used herein with respect to antibodies, an antibody 
is said to "selectively bind" to a target if the antibody recognizes and binds the target 
of interest but does not substantially recognize and bind other molecules in a sample, 
e.g., a biological sample, which includes the target of interest. That is, the antibody 



SUBSTITUTE SHEET (RULE 26) 



W ° 97/27296 PCT/CA97/0005, 



-11- 



10 



15 



20 



25 



30 



must bind to its target with sufficient specificity so as to distinguish the target from 
essentially all of molecules which would reasonably be present in a biological sample 
including the target. 

n - The Presenilins and Pr esenilin-Interacting Proteins 

The present invention is based, in part, upon the discovery of a family of 
mammalian genes which, when mutated, are associated with the development of 
Alzheimer's Disease. The discovery of these genes, designated presenilin-1 (PS1) and 
presenilin-2 (PS2), as well as the characterization of these genes, their protein 
products, mutants, invertebrate homologues, and possible functional roles, are 
described in PCT Publication WO96/34099. The present invention is further based, in 
part, upon the discovery of a group of proteins which interact with the presenilins 
under physiological conditions and which, therefore, are believed to be involved in the 
biochemical pathways which are altered in Alzheimer's Disease. These proteins are 
referred to herein as presenilin-interacting (PS-interacting) proteins. Because 
mutations in the presenilins are known to be causative of Alzheimer's Disease, each of 
the PS-interacting genes and proteins disclosed and described herein presents a novel 
target for therapeutic intervention in Alzheimer's Disease. That is, modulation of the 
interactions of these proteins with the presenilins, or modulation of the interactions of 
at least the PS-interacting domains of these PS-interacting proteins with at least the 
interacting domains of the presenilins, provides a means of modulating the activity 
and/or availability of the presenilins, or of modulating the activity and/or availability 
of the PS-interacting proteins. Furthermore, as aberrations in the interactions of 
mutant presenilins with one or more of these PS-interacting proteins is causative of 
Alzheimer's Disease, mutations in one or more of these PS-interacting proteins are 
also likely to be causative of Alzheimer's Disease. Therefore, each of the PS- 
interacting genes and proteins disclosed and described herein presents a novel target 
for diagnosis of forms of familial and/or sporadic Alzheimer's Disease with an 
etiology independent of mutations in the presenilins. Finally, as described more fully 
below, the PS-interacting genes and proteins described and disclosed herein provide 
for new assays for compounds which affect the interactions of the presenilins and PS- 
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interacting proteins, assays for other members of the biochemical pathways involved 
in the etiology of Alzheimer's Disease, and new cell lines and transgenic animal 
models for use in such assays. 

5 1. Presenilin Processing 

Employing the antibodies and protein-binding assays described and/or 
enabled in PCT Publication WO96/34099, the processing and protein-protein 
interactions of both normal and mutant presenilins were investigated. It was found 
that mutations in the presenilins appear to lead to changes in both their intracellular 

10 processing (e.g., endoproteolytic cleavage, ubiquitination, and clearance) and their 
intracellular interactions with other proteins expressed in human brain. As described 
below, knowledge of presenilin processing and interactions, and particularly changes 
in mutant presenilin processing and interactions, provides for new diagnostic and 
therapeutic targets for Alzheimer's Disease and related disorders. 

15 Western blot analysis suggests that the normal presenilins undergo 

proteolytic cleavage to yield characteristic N- and C-terminal fragments. As noted 
above, the normal presenilin proteins have an expected molecular mass of 47-51 kDa 
depending, in part, upon mRNA splice variations, electrophoretic conditions, etc. 
Analysis of Western blots suggests, however, that the normal presenilin proteins 

20 undergo proteolytic cleavage to yield an approximately 35 kDa N-terminal fragment 
and an approximately 18 kDa C-terminal fragment. In particular, Western blots 
bearing lysates from wild-type native human fibroblasts, human neocortical brain 
tissue from control subjects, and neocortical brain tissue from non-transgenic and PS1 
transgenic mice using antibodies ("14.2") recognizing PS 1 -specific residues 1-25 at 

25 the N-terminus reveal the presence of a strong immunoreactive band of approximately 
35 kDa and, after longer exposures, a weaker band of approximately 45 kDa which 
presumably represents the full-length PS1 protein. Antibodies ("520") directed at 
residues 304-318 at the apex of the TM6->7 loop of PS1, and antibodies ("4627") 
directed at residues 457-467 in the C-terminus of PS1, both recognize the same strong 

30 band of approximately 18 kDa. Antibodies 520 also recognize a weak band of 45 kDa 
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coincident with the PS1 band detected by 14.2. Sequencing of the major C-terminal 
fragment from PSl-transfected human embryonic kidney cells (HEK 293) showed 
that the principal endoproteolytic cleavage occurs near M298 in the proximal portion 
of the TM6-+7 loop, possibly by enzymes other than the proteasome. These 
i observations suggest that an endoproteolytic cleavage event occurs near the junction 
ofexons9and lOofPSl. Full length PS 1 in these cells is quickly turned over (t m < 
60 min.) by the proteasome. 

To determine whether mutations in the presenilin proteins result in 
alterations of their proteolytic cleavage, Western blots containing lysates of fibroblast 
and neocortical brain homogenates from normal subjects and subjects carrying PS1 
mutations were investigated with the PS1 specific antibody Ab 14.2. In fibroblasts, 
there were no obvious differences in the relative intensities of the protein bands when 
lysates from heterozygous carriers of the PS1 mutations were compared with normal 
homozygotes. In contrast, there appeared to be a difference between PS1 mutation 
carriers and normals in homogenates of temporal neocortex from AD affected 
heterozygous carriers of either the PS1 A246E or C410Y mutations (which are located 
in TM6 and TM7 respectively). In heterozygotes, a strongly immunoreactive band of 
approximately 45 kDa was detected which initially appeared to correspond to the full- 
length PS1 protein. Further analysis, however, revealed that this band represents an 
alternatively processed presenilin product. A similar band corresponding to this 
mutant processed PS1 was observed in neocortical homogenates from some sporadic 
late-onset AD patients. These data suggest that (1) some pathogenic PS 1 mutations 
associated with early-onset AD alter the way in which the presenilis are processed 
through endoproteolytic and proteasome pathways and (2) the presenilin proteins, and 
changes in the processing of the presenilins in the brain, are also implicated in late- 
onset and sporadic AD. 

2 - Presenilin- Interacrinp Prr .t^„c 

In order to identify proteins which may bind to or otherwise interact with 
the presenilins inviyo, a yeast two-hybrid system was used as described below 
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(Example 1). In particular, because mutations in the TM6->7 loop domains are 
known to be causative of AD, a yeast two-hybrid system was used to identify cellular 
proteins which may interact with normal and mutant presenilin TM6-»7 loop 
domains. Yeast two-hybrid studies were also done with cDNAs corresponding to the 
5 C-terminal 18 kDa endoproteolytic cleavage fragment, and with cDNAs 

corresponding to the TM1— >2 intraluminal loop domain, which is also the site of the 
FAD associated Yl 15H missense mutation. In brief, cDNA sequences encoding the 
TM6-»7 loop (i.e., residues 266 to 409 of PS1) were ligated in-frame to the GAL4 
DNA-binding domain in the pAS2-l yeast expression plasmid vector (Clontech). 

10 This plasmid was then co-transformed into S. cerevisiae strain Y190 together with a 
library of human brain cDNAs ligated into the pACT2 yeast expression vector bearing 
the GAL4 activation domain (Clontech). After appropriate selection and re-screening, 
a number of clones were recovered and sequenced bearing human brain cDNAs 
encoding peptides which interacted with the normal presenilin TM6-»7 domain. To 

15 determine whether these presenilin interactions would be modified by AD related 

mutations within the TM6-*7 loop, the yeast two-hybrid system was again used with 
TM6->7 loop peptides containing the L286V, the L392V, and the exon 10 splicing 
mutants. When these mutant constructs were used as "bait" to re-screen the brain 
cDNA:GAL4 activation domain library, some but not all of the brain cDNA 

20 sequences which interacted with the normal presenilin were recovered. In addition, 
several new clones were identified which interacted with the mutant but not the 
normal presenilins. The clones corresponding to the PS-interacting proteins with the 
highest presenilin affinity are described in Example 1 and below. 

PS-interacting proteins, particularly those which interact selectively with 

25 either the normal or mutant presenilins, provide new targets for the identification of 
useful pharmaceuticals, new targets for diagnostic tools in the identification of 
individuals at risk, new sequences for the production of transformed cell lines and 
transgenic animal models, and new bases for therapeutic intervention in Alzheimer's 
Disease. In particular, the onset of AD may be associated with aberrant interactions 

30 between mutant presenilin proteins and normal forms of PS-interacting proteins such 
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as those identified using the methods described herein. These changes may increase 
or decrease interactions present with normal PSl or may cause interaction with a 
novel mutation-specific PS-interacting protein. In addition, however, aberrant 
interactions may result from normal presenilins binding to mutant forms of the PS- 
i interacting proteins and, therefore, mutations in the PS-interacting proteins may also 
be causative of AD. 

A - The SSa Suh unit of the. 76$ Proteasnme 

Two overlapping clones have been identified as representing a portion of 
the human protein alternatively known as Antisecretory Factor ("ASF") orthe 
Multiubiquitin chain-binding S5a subunit of the 26S proteasome ("SSa"). These 
clones, which together include residues 7£372*£S5s, were shown to interact with 
n^raMlprMmiliii TM6-+7 loop domain but only weakly with two TM6->7 loop 
domain mutants tested (L286V, L392V). The PSl :S5a interaction was confirmed by 
co-immunoprecipitation studies, and immunocytochemical studies showed SSa and 
PSl are expressed in contiguous intracellular compartments in brain cells typically 
affected by AD. 

The interaction between PSl and the proteasome could be relevant to the 
pathogenesis of Alzheimer's Disease (AD) through several possible mechanisms. 
First, most mammalian cells seem to maintain very low levels of the PSl holoprotein. 
A notable exception to this are cells expressing the PSl A290-3 19 splicing mutation, 
which results in a mutant PS 1 holoprotein which is not endoproteolytically cleaved 
and which is, therefore, readily detectable. In the case of the A290-3 19 splicing 
mutation at least, the presence of the mutant PSl holoprotein, or the absence or 
reduction in the 35 kDa N-terminal and 1 8 kDa C-terminal fragments, appears 
sufficient to cause AD. It is possible, therefore, that even very subtle changes in the 
turnover of the mutant PSl holoprotein might have significant pathophysiological 
effects. Thus, mutations in either the presenilins or S5a which perturb the PSl :S5a 
interaction in the mammalian CNS may cause the presenilin holoprotein to be 
aberrantly processed and cause AD. Therefore, modulation of presenilin proteolytic 
pathways might be applied therapeutically to enhance removal of mutant holoprotein. 
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To assess a potential in vivo relationship between PS1 and the S5a subunit 
of the 26S proteasome, the effects of proteasome inhibitors on PS1 metabolism were 
investigated. Short term organotypic cultures of neonatal rat hippocampus and 
carcinoma of colon (CaCo2) cells (which express high levels of both PS1 and PS2) 
5 were administered either the specific, reversible proteasome inhibitor N-acetyl- 

leucinyl-leucinyl-norleucinyl-H (LLnL) (Rock et al., 1994), or the specific irreversible 
proteasome inhibitor lactacystin (Fenteany et al., 1995). Both agents caused an 
increase in the steady state levels of PS1 holoprotein. Both agents also prolonged the 
half-life of the PS1 holoprotein in pulse chase experiments in hippocampal slices from 

10 -15 minutes to -35 minutes. As noted above, the PS1 holoprotein appears to be 
rapidly turned over in normal cells. However, even after four hours of metabolic 
labelling, neither of the proteasome inhibitors affected the level of the 35 kDa N- 
terminal PS1 fragment, or resulted in the appearance of novel species. These studies 
imply that the majority of the PS1 holoprotein is catabolized directly via a rapid, 

15 proteasome dependent pathway in a manner similar to several other integral 

membrane proteins (e.g. Sec61 and CFTR). On the other hand, because the -35 kDa 
and - 1 8 kDa terminal fragments are still produced in the presence of proteasome 
inhibitors, this endoproteolytic cleavage of PS I is probably not mediated by the 
proteasome pathway. Therefore, it appears that at least two proteolytic pathways act 

20 upon the PS1 holoprotein. 

An alternate possibility is that mutant PS1 :S5a interactions may modify 
the function or the cellular regulation of S5a. To address this possibility, S5a levels 
were examined by Western blotting of lysates from postmortem temporal neocortex 
from non-AD neurologic controls (n = 8), sporadic AD (n = 8) and PS 1 -linked FAD 

25 (n = 4). In the majority of non-AD brains, polyclonal anti-S5a antibodies specifically 
detected an S5a species with Mr of ~ 50 kDa, which could be abolished by 
preabsorption of the antibody with recombinant His 6 -S5a or with extracts of mvc -SSa 
transfected cells. In a subset of these control cases an additional S5a reactive band 
was observed at ~34 kDa. In contrast, in tissue from all subjects with sporadic late 

30 onset AD, the predominant S5a reactive species was observed at - 40 kDa which was 
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not seen in control tissue. The origin, and the functional significance of this altered 
electrophoretic mobility is unclear but indicates that S5a processing is altered in AD 
brains, irrespective of whether the AD is presenilin-linked or sporadic. 

Thus, the presenilin-proteasome interaction appears significant in several 
respects. First, the facts that the normal presenilin TM6->7 loop domain interacts 
with the S5a protein, that the mutant presenilin TM6-+7 loop domains fail to interact 
(or interact very weakly) with the S5a protein, that presenilins bearing mutations in 
the TM6-+7 loop domain appear to be differently cleaved and multiubiquitinated, that 
proteasomes are known to be involved in the cleavage and clearance of a variety of 
proteins (particularly multiubiquitinated proteins), that inhibition of proteasome 
activity inhibits cleavage of the presenilin nonproteins, and that S5a processing is 
altered in AD brains, all suggest (1) that the S5a subunit and the 26S proteasome are 
involved in the normal processing of the presenilins and that mutations which disrupt 
this normal interaction may be responsible for the abnormal processing observed in 
TM6->7 loop domain mutants; or (2) that the presenilin-proteasome interaction may 
modulate the activity of PSl, S5a, or both, with or without involving proteasome- 
mediated presenilin processing; or (3) that modulation of the normal quality control 
function of proteasome-mediated degradation of misfolded or mutant membrane 
proteins trafficking through the ER and Golgi (such as APP, Notch, or Prion proteins), 
and of misfolded, mutant, or ubiquitinated cytoplasmic proteins (including structural 
proteins such as tau, and short lived, proteasome processed signaling molecules such 
as NFkB). Thus, defective proteasome function might selectively cause these proteins 
(especially PAPP, tau, Prion) to be aberrantly metabolized. The latter would lead to 
the accumulation of neurotoxic, amyloidogenic protease-resistant derivatives such as 
AP and PrPsc, the accumulation of neurofibrillary tangles, and defective intracellular 
signaling functions. In support of these hypotheses, it should be noted that failure to 
clear hyperubiquitinated phosphorylated tau and other microtubule associated proteins 
is a prominent feature of Alzheimer's Disease (Kosik and Greenberg, 1994), 
suggesting a possible link between TM6-»7 loop domain mutants, presenilin- 
proteasome interactions, tau-proteasome interactions, and the neurofibrillary tangles 
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of tau protein in AD brains. Finally, proteasomes are known to be capable of 
degrading APP and of binding the A0 peptides which are associated with Alzheimer's 
Disease, suggesting a possible link between TM6— >1 loop domain mutants, 
presenilin-proteasome interactions, APP-proteasome interactions, and the amyloid 
5 plaques characteristic of AD brains. Furthermore, administration of proteasome 
inhibitors such as LLnL and Lactacystin cause severe disturbances in pAPP 
metabolism with increases in intracellular immature N-glycosylated PAPP, and the 
secretion of much larger amounts of AP 42 isoforms into the media (Klafki, et al., 
1996). 

10 Therefore, presenilin processing and the presenilin-proteasome interaction 

are clear targets for the diagnosis as well as therapeutic intervention in AD. Thus, as 
described below, assays may now be provided for drugs which affect the proteasome- 
mediated cleavage of the presenilins, which affect the alternative endoproteolytic 
cleavage and ubiquitination of the mutant presenilins, or which otherwise affect the 

15 processing and trafficking of the presenilins or the S5a subunit of the proteasome. In 
addition, as mutations in the 26S proteasome which disrupt the normal processing of 
the presenilins are likely to be causative of Alzheimer's Disease, additional diagnostic 
assays are provided for detecting mutations in the S5a or other subunits of the 
proteasome. Finally, additional transformed cell lines and transgenic models may 

20 now be provided which have been altered by the introduction of a normal or mutant 
sequence encoding at least a functional domain of the proteasome. The appearance of 
abnormal electrophoretic forms of S5a (and/or other proteasome subunits) in biologic 
tissues and fluids can be used as a clinical test for diagnosis and monitoring of disease 
activity in subjects with sporadic forms of AD. 

25 B. GT24: A Protein with "Armadillo" Repeats 

Another PS-interacting protein, designated GT24, was identified from 
several over- lapping clones obtained using a PS1 266-W >9 domain as bait in the yeast two- 
hybrid system and a human adult brain cDNA library. Six longer GT24 clones of 
-3.8 kb in size were subsequently obtained by screening of conventional cDNA 

30 libraries. The open reading frame within the longest GT24 clone obtained to date 
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(Accession number U81004) suggests that GT24 is a protein of at least 1040 amino 
acids with a unique N-terminus, and considerable homology to several armadillo 
(arm) repeat proteins at its C-terminus. Thus, for example, residues 440-862 of GT24 
(numbering from Accession number U8I004) have 32-56% identity (p=1.2e 135 ) to 
residues 440-854 of murine pl20 protein (Accession number Zl 7804), and residues 
367-815 of GT24 have 26-42% identity (p=0.0017) to residues 245-465 of the D, 
melanogaster armadillo segment polarity protein (Accession number PI 8824). The 
GT24 gene maps to chromosome 5pl5 near the anonymous microsatellite marker 
D5S748 and the Cri-du-Chat syndrome locus. 

Hybridization of unique 5' sequences of GT24 to Northern blots reveals 
that the GT24 gene is expressed as a range of transcripts varying in size between -3.9 
and 5.0 kb in several regions of human brain, and in several non-neurologic tissues 
such as heart. In addition, in situ hybridization studies using a 289 bp single copy 
fragment from the 5' end of GT24 in four month old murine brain reveal GT24 
transcription closely parallels that of PS1, with robust expression in dentate and 
hippocampal neurons, in scattered neocortical neurons, and in cerebellar Purkinje 
cells. In day E13 murine embryos, GT24 is widely expressed at low levels, but is 
expressed at somewhat higher levels in somites and in the neural tube. A 
physiological mvivo interaction between GT24 and PS1 is supported by co- 
immunoprecipitation studies in HEK293 cells transiently transfected with a wild type 
human PS1 cDNA, a fcjrjyc-tagged cDNA encoding residues 484-1040 of GT24 
(including the C-terminal arm. repeats), or both cDNAs. Cell lysates were 
immunoprecipitated with anti-PSl antibodies and then investigated for the presence of 
the myc : GT24 protein by immuno-blotting. In PSl/myc-GT24 double transfected 
cells, the immunoprecipitates contained a robust anti-myc reactive band of Mr -60 
kDa, which co-migrated with a myc-GT24 control. In cells transfected with myc- 
GT24 only, a very weak band was detected after long exposures, presumably 
reflecting interaction of me myc-GT24 with low levels of endogenous PS I. No myc- 
reactive bands were detected in cells transfected with PS1 alone, or in any of the 
transfected cells immunoprecipitated with pre-immune serum. Taken together, these 
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observations strongly suggest that the observed PS 1 :GT24 interaction is 
physiologically relevant. 

To explore whether mutations in the TM6-TM7 loop of PS1 might 
influence the PS1:GT24 interaction, we employed quantitative liquid P-galactosidase 
5 assays to directly compare the yeast-two-hybrid interaction of the C-terminal residues 
499-1040 of GT24 with wildtype and mutant PS1 2<MM09 . These studies revealed that 
the interaction of GT24 499 . I040 with a L286V mutant PS1 domain was not significantly 
different from the interaction with the corresponding wild type PSi domain. In 
contrast, there was a significant reduction in the GT24 499 . 1040 interaction with the 
10 L392V mutant PS 1 construct. The absence of an effect of the L286V mutation, and 
the presence of an effect with the L392V mutation, may suggest that some mutations 
may effect PS 1 :GT24 binding, while others may modulate the PSI response to GT24 
binding. 

The PS 1 :GT24 interaction could support several functions. The arm repeat 
15 motif of GT24 has been detected in several proteins with diverse functions including 
P-catenin and its invertebrate homologue armadillo, plakoglobin, pi 20, the 
adenomatous polyposis coli (APC) gene, suppressor of RNA polymerase 1 in yeast 
(SRP1), and smGDS. For example, p-catenin, pi 20 and plakoglobin play an essential 
role in intercellular adhesion. P-catenin /armadillo is involved in transduction of 
20 wineless/Wnt signals during cell fate specification, and P-catenin and pi 20 may play 
a role in other receptor mediated signal transduction events including responses to 
trophic factors such as PDGF, EGF, CSF-1 and NGF. 

If the PS 1 :GT24 interaction is part of intercellular signaling pathways for 
trophic factors, or is involved in cell-cell adherence, disruption of the interaction may 
25 be involved in the neurodegenerative processes in PS-linked FAD brains, and in the 
increased sensitivity of PSI or PS2 transfected cells to apoptosis (Wolozin et al., 
1996). It is of note that at least one ami protein, smGDS, stimulates GDP/GTP 
exchange on intracellular G-proteins (Kikuchi et al. 1992; Borguski et al., 1993), and 
that mutant forms of both PAPP and PS2 are thought to activate programmed cell 
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death pathways through mechanisms involving heterotrimeric GTP/GDP proteins 
(Wolozin et al, 1996; Okamoto, et al., 1995; Yamatsuji, et al, 1996). 

The interaction between PS1 and GT24 may also be involved in some of 
the developmental phenotypes associated with homozygous PS1 knockouts in mice 
such as failed somitogenesis of the caudal embryo, short tail, and fatal cerebral 
hemorrhage at around day E 13.5 (Wong et al., 1996). The resemblance of these 
skeletal phenotypes to those associated with null mutations in PAX1 and Notch , and 
the apparent suppressor effect of mutations in sell2 on Notch/linl2 mediated 
signaling in C. elegans suggest that the PS proteins function in the Notch signaling 
pathway. In addition, mice homozygous for a knockout of the Wnt-3a gene (Takada 
et al., 1994), and murine homozygotes for a spontaneous mutation, "vestigial tail" or 
yt, in the WjrtOa gene (Greco et al., 1996), have skeletal phenotypes of defective 
caudal somite and tail bud formation. The Wpt-3a knockouts are embryonic lethal by 
day 12.5. These phenotypes are similar to those of homozygous knockouts of the 
murine PS1 gene (Wong et al., 1996). The observation that GT24 binds to PSl, is 
expressed in embryonic somites, and contains the armadillo repeat motif of other 
proteins used in the downstream signaling in the Wineless/Wnt pathway suggests that 
PSl is a downstream element in the GT24 -Win e le SS /Wnt pathway. This can be 
exploited to create a bioassay for drugs affecting the GT24-PS1 interaction directly, or 
affecting upstream or downstream components of that interaction pathway, and can 
therefore be used to monitor the effects of presenilin mutations. For example, cells 
transfected with normal or mutant presenilis may be exposed to soluble Wnt-3a 
protein (or other Wnt proteins such as Wnt-1) and assayed for changes which are 
specific to the Wingless/Wnt signaling pathway, or for any of the other changes 
described herein for cell assays (e.g., intracellular ion levels, Ap processing, 
apoptosis, etc.). 

Thus, the GT24 protein also presents new targets for diagnosis as well as 
therapeutic intervention in AD. For example, as mutations in the GT24 protein may 
also be causative of Alzheimer's Disease, additional diagnostic assays are provided for 
detecting mutations in these sequences. Similarly, additional transformed cell lines 
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and transgenic models may now be provided which have been altered by introduction 
of a normal or mutant nucleic acid encoding at least a functional domain of the GT24 
protein, and particularly the functional domains (e.g., residues 70-377) which interact 
with the presenilins. Such transformed cells and transgenics will have utility in assays 
5 for compounds which modulate the presenilin-GT24 interactions. 

C. pQ071: A Protein with "Armadillo" Repeats 

Another independent clone isolated in the initial screening with the wild 
type PS 1 266-409 "bait" also encodes a peptide with C-terminal arm repeats (clone 
Y2H25, Accession number U81005). A longer cDNA sequence corresponding to the 

10 Y2H25 clone has been deposited with GenBank as human protein p0071 (Accession 
number X81889) and is reproduced herein as SEQ ID NO: 5. Clone Y2H25 
corresponds essentially to nucleotide positions 1682-1994 of SEQ ID NO: 5. 
Comparison of the predicted sequence of the Y2H25/p0071 ORF with that of GT24 
confirms that they are related proteins with 47% overall amino acid sequence identity, 

15 and with 70% identity between residues 346-862 of GT24 and residues 509-1022 of 
p0071. This suggests that PS1 interacts with a novel class of arm repeat containing 
proteins. The broad --4.5 kb hybridization signal obtained on Northern blots with the 
unique 5' end of GT24 could reflect either alternative splicing/polyadenylation of 
GT24 or, less likely, the existence of additional members of this family with higher 

20 degrees of N-terminal homology to GT24 than p0071 . Cells transformed with these 
sequences, or transgenic animals including these sequences, will have additional 
utility as animal models of AD and for use in screening for compounds which 
modulate the action of normal and mutant presenilins. 

D. Rab 11 

25 One clone (Y2H9), disclosed herein as SEQ ID NO: 5, was identified as 

interacting with the normal PS1 TM6-*7 loop domain and appears to correspond to a 
known gene, Rabl 1, available through Accession numbers X56740 and X53143. 
Rabl 1 is believed to be involved in protein/vesicle trafficking in the ER/Golgi. Note 
the possible relationship to processing of membrane proteins such as fiAPP and Notch 
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with resultant overproduction of toxic AB peptides (especially neurotoxic AB W , 
isoforms) (Scheuner, et al, 1995). 

E- Retinoid X Receptor. ft 

One clone (Y2H23b), disclosed herein as SEQ ID NO: 6, was identified as 
interacting with the normal PS1 TM6-»7 loop domain and appears to correspond to a 
known gene, known variously as the retinoid X receptor-p, nuclear receptor co- 
regulator, or MHC Class I regulatory element, and is available through Accession 
numbers M84820, X63522 and M81766. This gene is believed to be involved in 
intercellular signaling, suggesting a possible relationship to the intercellular signaling 
function mediated by C^elegans sell2 and Notch/lin-12 (transcription activator). 
F. Cytoplasmic Chaperonin 

One clone (Y2H27), disclosed herein as SEQ ID NO: 8, was identified as 
interacting with the normal PS1 TM6->7 loop domain and appears to correspond to a 
known gene, a cytoplasmic chaperonin containing TCP-1, available through 
15 Accession numbers Ul 7 1 04 and X74801 . 
G- Clone Y2H3S 

One clone (Y2H35), disclosed herein as SEQ ID NO: 7, was identified as 
interacting with the normal PS1 TM6->7 loop domain and appears to correspond to a 
sequence that codes for a protein of unknown function, available through Accession 
number R12984, but which displays evolutionary conservation in yeast sequences. 
H. Clone Y2H1 71 

One clone (Y2H171), disclosed herein as SEQ ID NO: 9, was identified as 
interacting with the normal PS 1 TM6->7 loop domain and appears to correspond to a 
known expressed repeat sequence available through Accession number D55326. 
25 I. Clone Y2H41 

One clone (Y2H41) was identified which reacts strongly with the TM6->7 
loop domains of both PS1 and PS2 as well as the mutant loop domains of PS 1. The 
sequence, disclosed as SEQ ID NO: 10, shows strong homology to an EST of 
unknown function (Accession number T64843). 



20 
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III. Preferred Embodiments 

Based, in part, upon the discoveries disclosed and described herein, the 
following preferred embodiments of the present invention are provided. 

5 

1. Isolated Nucleic Acids 

In one series of embodiments, the present invention provides isolated 
nucleic acids corresponding to, or relating to, the nucleic acid sequences disclosed 
herein, which encode at least the PS-interacting domain of a PS-interacting protein. 

10 As described more fully below, the disclosed and enabled sequences include normal 
sequences from humans and other mammalian species, mutant sequences from 
humans and other mammalian species, homologous sequences from non-mammalian 
species such as Drosophila and C. eleeans. subsets of these sequences useful as probes 
and PCR primers, subsets of these sequences encoding fragments of the PS-interacting 

15 proteins or corresponding to particular structural domains or polymorphic regions, 
complementary or antisense sequences corresponding to fragments of the PS- 
interacting protein genes, sequences in which the PS-interacting protein coding 
regions have been operably joined to exogenous regulatory regions, and sequences 
encoding fusion proteins in which portions of the PS-interacting proteins are fused to 

20 other proteins useful as markers of expression, as "tags" for purification, or in screens 
and assays for other proteins which interact with the PS-interacting proteins. 

Thus, in a first series of embodiments, isolated nucleic acid sequences are 
provided which encode at least a PS-interacting domain of a normal or mutant version 
of a PS-interacting protein. Examples of such nucleic acid sequences are disclosed 

25 herein as SEQ ID NOs: 1,3, and 5-10. In addition, given the sequences of the PS- 
interacting domains of the PS-interacting proteins disclosed herein, one of ordinary 
skill in the art is clearly enabled to obtain the entire genomic or cDNA sequence 
encoding the entire PS-interacting proteins. Thus, for example, based upon the initial 
clone of the GT24 protein obtained using the yeast two-hybrid system (Example 1), 

30 the larger GT24 clone disclosed as SEQ ID NO: 3 was obtained by standard methods 
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known in the art. Complete cDNA or genomic clones of each of the genes encoding 
the disclosed sequences may be similarly obtained by one of ordinary skill in the art. 
Therefore, the present invention provides complete genomic sequences as well as 
cDNA sequences corresponding to the PS-interacting protein genes of the invention. 
Alternatively, the nucleic acids of the invention may comprise recombinant genes or 
"minigenes" in which all or some introns of the PS-interacting protein genes have 
been removed, or in which various combinations of introns and exons and local ds- 
acting regulatory elements have been engineered in propagation or expression 
constructs or vectors. For purposes of reducing the size of a recombinant PS- 
interacting protein gene, a cDNA gene may be employed, or various combinations of 
introns and untranslated exons may be removed from a DNA construct. These and 
many variations on these embodiments are now enabled by the identification and 
description of the PS-interacting proteins provided herein. 

In addition to the disclosed PS-interacting protein and gene sequences, one 
of ordinary skill in the art is now enabled to identify and isolate nucleic acids 
representing PS-interacting genes or cDNAs which are allelic to the disclosed 
sequences or which are heterospecific homologues. Thus, the present invention 
provides isolated nucleic acids corresponding to these alleles and homologues, as well 
as the various above-described recombinant constructs derived from these sequences, 
by means which are well known in the art. Briefly, one of ordinary skill in the art 
may now screen preparations of genomic or cDNA, including samples prepared from 
individual organisms (e.g., human AD patients or their family members) as well as 
bacterial, viral, yeast or other libraries of genomic or cDNA, using probes or PCR 
primers to identify allelic or homologous sequences. Because it is desirable to 
identify mutations in the PS-interacting proteins which may contribute to the 
development of AD or other disorders, because it is desirable to identify 
polymorphisms in the PS-interacting proteins which are not pathogenic, and because 
it is also desirable to create a variety of animal models which may be used to study 
AD and screen for potential therapeutics, it is particularly contemplated that additional 
PS-interacting protein sequences will be isolated from other preparations or libraries 
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of human nucleic acids and from preparations or libraries from animals including rats, 
mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, and non-human 
primates. Furthermore, PS-interacting protein homologues from yeast or invertebrate 
species, including C. elegans and other nematodes, as well as Drosophila and other 
5 insects, may have particular utility for drug screening. 

Standard hybridization screening or PCR techniques may be employed (as 
used, for example, in the identification of the mPS 1 gene disclosed in PCT 
Publication WO96/34099) to identify and/or isolate such allelic and homologous 
sequences using relatively short PS-interacting protein gene sequences. The 

10 sequences may include 8 or fewer nucleotides depending upon the nature of the target 
sequences, the method employed, and the specificity required. Future technological 
developments may allow the advantageous use of even shorter sequences. With 
current technology, sequences of 9-50 nucleotides, and preferably about 1 8-24 are 
preferred. These sequences may be chosen from those disclosed herein, or may be 

15 derived from other allelic or heterospecific homologues enabled herein. When 
probing mRNA or screening cDNA libraries, probes and primers from coding 
sequences (rather than introns) are preferably employed, and sequences which are 
omitted in alternative splice variants typically are avoided unless it is specifically 
desired to identify those variants. Allelic variants of the PS-interacting protein genes 

20 may be expected to hybridize to the disclosed sequences under stringent hybridization 
conditions, as defined herein, whereas lower stringency may be employed to identify 
heterospecific homologues. 

In another series of embodiments, the present invention provides for 
isolated nucleic acids which include subsets of the PS-interacting protein sequences or 

25 their complements. As noted above, such sequences will have utility as probes and 
PCR primers in the identification and isolation of allelic and homologous variants of 
the PS-interacting protein genes. Subsequences corresponding to polymorphic 
regions of the PS-interacting proteins, will also have particular utility in screening 
and/or genotyping individuals for diagnostic purposes, as described below. In 

30 addition, and also as described below, such subsets will have utility for encoding (1 ) 
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fragments of the PS-interacting proteins for inclusion in fusion proteins, (2) fragments 
which comprise functional domains of the PS-interacting proteins for use in binding 
studies, (3) fragments of the PS-interacting proteins which may be used as 
immunogens to raise antibodies against the PS -interacting proteins, and (4) fragments 
of the PS-interacting proteins which may act as competitive inhibitors or as mimetics 
of the PS-interacting proteins to inhibit or mimic their physiological functions. 
Finally, such subsets may encode or represent complementary or antisense sequences 
which can hybridize to the PS -interacting protein genes or PS-interacting protein 
mRNA transcripts under physiological conditions to inhibit the transcription or 
translation of those sequences. Therefore, depending upon the intended use, the 
present invention provides nucleic acid subsequences of the PS-interacting protein 
genes which may have lengths varying from 8-10 nucleotides (e.g., for use as PCR 
primers) to nearly the full size of the PS-interacting protein genomic or cDNAs. 
Thus, the present invention provides isolated nucleic acids comprising sequences 
corresponding to at least 8-10, preferably 15, and more preferably at least 20 
consecutive nucleotides of the PS-interacting protein genes, as disclosed or otherwise 
enabled herein, or to their complements. As noted above, however, shorter sequences 
may be useful with different technologies. 

In another series of embodiments, the present invention provides nucleic 
acids in which the coding sequences for the PS-interacting proteins, with or without 
introns or recombinantly engineered as described above, are operably joined to 
endogenous or exogenous 5' and/or 3' regulatory regions. Using the present disclosure 
and standard genetic techniques (e.g., PCR extensions, targeting gene walking), one of 
ordinary skill in the art is now enabled to clone the 5' and/or 3 1 endogenous regulatory 
regions of any of the disclosed PS-interacting protein genes. Similarly, allelic 
variants of these endogenous regulatory regions, as well as endogenous regulatory 
regions from other mammalian homologues, are similarly enabled without undue 
experimentation. Alternatively, exogenous regulatory regions (i.e., regulatory regions 
from a different conspecific gene or a heterospecific regulatory region) may be 
operably joined to the PS -interacting protein coding sequences in order to drive 



SUBSTITUTE SHEET (RULE 26) 



WO 97/27296 



PCT/CA97/00051 



-28- 



expression. Appropriate 5 ! regulatory regions will include promoter elements and 
may also include additional elements such as operator or enhancer sequences, 
ribosome binding sequences, RNA capping sequences, and the like. The regulatory 
region may be selected from sequences that control the expression of genes of 
5 prokaryotic or eukaryotic cells, their viruses, and combinations thereof. Such 

regulatory regions include, but are not limited to, the lac system, the trp system, the 
tac system, and the trc system; major operator and promoter regions of phage X\ the 
control region of the fd coat protein; early and late promoters of SV40; promoters 
derived from polyoma, adenovirus, retrovirus, baculovirus, and simian virus; 3- 

10 phosphoglycerate kinase promoter; yeast acid phosphatase promoters; yeast alpha- 
mating factors; promoter elements of other eukaryotic genes expressed in neurons or 
other cell types; and combinations thereof. In particular, regulatory elements may be 
chosen which are inducible or repressible (e.g., the P-galactosidase promoter) to allow 
for controlled and/or manipulable expression of the PS-interacting protein genes in 

15 cells transformed with these nucleic acids. Alternatively, the PS-interacting protein 
coding regions may be operably joined with regulatory elements which provide for 
tissue specific expression in multicellular organisms. Such constructs are particularly 
useful for the production of transgenic organisms to cause expression of the PS- 
interacting protein genes only in appropriate tissues. The choice of appropriate 

20 regulatory regions is within the ability and discretion of one of ordinary skill in the art 
and the recombinant use of many such regulatory regions is now established in the art. 

In another series of embodiments, the present invention provides for 
isolated nucleic acids encoding all or a portion of the PS-interacting proteins in the 
form of a fusion protein. In these embodiments, a nucleic acid regulatory region 

25 (endogenous or exogenous) is operably joined to a first coding region which is 
co valently joined in-frame to a second coding region. The second coding region 
optionally may be covalently joined to one or more additional coding regions and the 
last coding region is joined to a termination codon and, optionally, appropriate 3' 
regulatory regions (e.g., polyadenylation signals). The PS-interacting protein 

30 sequences of the fusion protein may represent the first, second, or any additional 
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coding regions. The PS-interacting protein sequences may be conserved or non- 
conserved domains and can be placed in any coding region of the fusion. The non- 
PS-interacting protein sequences of the fusion may be chosen according to the needs 
and discretion of the practitioner and are not limited by the present invention. Useful 
5 non-PS-interacting protein sequences include, for example, short sequence "tags" such 
as antigenic determinants or poly-His tags which may be used to aid in the 
identification or purification of the resultant fusion protein. Alternatively, the non- 
PS-interacting protein coding region may encode a large protein or protein fragment, 
such as an enzyme or binding protein which also may assist in the identification and 

10 purification of the protein, or which may be useful in an assay such as those described 
below. Particularly contemplated fusion proteins include poly-His and GST 
(glutathione S-transferase) fusions which are useful in isolating and purifying the 
presenilins-interacting proteins, and the yeast two hybrid fusions, described below, 
which are useful in assays to identify other proteins which bind to or interact with the 

15 PS-interacting proteins. 

In another series of embodiments, the present invention provides isolated 
nucleic acids in the form of recombinant DNA constructs in which a marker or 
reporter gene (e.g., fJ-galactosidase, luciferase) is operably joined to the 5' regulatory 
region of a PS-interacting protein gene such that expression of the marker gene is 
under the control of those regulatory sequences. Using the PS-interacting protein 
regulatory regions enabled herein, including regulatory regions from human and other 
mammalian species, one of ordinary skill in the art is now enabled to produce such 
constructs. As discussed more fully below, such isolated nucleic acids may be used to 
produce cells, cell lines or transgenic animals which are useful in the identification of 
compounds which can, directly or indirectly, differentially affect the expression of the 
PS-interacting proteins. 

Finally, the isolated nucleic acids of the present invention include any of 
the above described sequences when included in vectors. Appropriate vectors include 
cloning vectors and expression vectors of all types, including plasmids, phagemids, 
cosmids, episomes, and the like, as well as integration vectors. The vectors may also 
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include various marker genes (e.g., antibiotic resistance or susceptibility genes) which 
are useful in identifying cells successfully transformed therewith. In addition, the 
vectors may include regulatory sequences to which the nucleic acids of the invention 
are operably joined, and/or may also include coding regions such that the nucleic 
5 acids of the invention, when appropriately ligated into the vector, are expressed as 
fusion proteins. Such vectors may also include vectors for use in yeast "two hybrid," 
baculovirus, and phage-display systems. The vectors may be chosen to be useful for 
prokaryotic, eukaryotic or viral expression, as needed or desired for the particular 
application. For example, vaccinia virus vectors or simian virus vectors with the 

10 SV40 promoter (e.g., pSV2), or Herpes simplex virus or adeno-associated virus may 
be useful for transfection of mammalian cells including neurons in culture or in vivo. 
and the baculovirus vectors may be used in transfecting insect cells (e.g., butterfly 
cells). A great variety of different vectors are now commercially available and 
otherwise known in the art, and the choice of an appropriate vector is within the 

15 ability and discretion of one of ordinary skill in the art. 

2. Substantially Pure Proteins 

The present invention provides for substantially pure preparations of the 
PS-interacting proteins, fragments of the PS-interacting proteins, and fusion proteins 

20 including the PS-interacting proteins or fragments thereof. The proteins, fragments 
and fusions have utility, as described herein, in the generation of antibodies to normal 
and mutant PS-interacting proteins, in the identification of proteins (aside from the 
presenilins) which bind to the PS -interacting proteins, and in diagnostic and 
therapeutic methods. Therefore, depending upon the intended use, the present 

25 invention provides substantially pure proteins or peptides comprising amino acid 

sequences which are subsequences of the complete PS-interacting proteins and which 
may have lengths varying from 4-10 amino acids (e.g., for use as immunogens), or 10- 
100 amino acids (e.g., for use in binding assays), to the complete PS-interacting 
proteins. Thus, the present invention provides substantially pure proteins or peptides 

30 comprising sequences corresponding to at least 4-5, preferably 6-10, and more 
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preferably at least 50 or 100 consecutive amino acids of the PS-interacting proteins, as 
disclosed or otherwise enabled herein. 

The proteins or peptides of the invention may be isolated and purified by 
any of a variety of methods selected on the basis of the properties revealed by their 
protein sequences. For example, the PS-interacting proteins may be isolated fiom 
cells in which the PS-interacting protein is normally highly expressed. Alternatively 
the PS-interacting protein, fusion protein, or fragment thereof, may be purified from 
cells transformed or transfected with expression vectors (e.g., baculovirus systems 
such as the pPbac and pMbac vectors (Stratagene, La Jolla, CA); yeast expression 
systems such as the pYESHIS Xpress vectors (Invitrogen, San Diego, CA); eukaryotic 
expression systems such as pcDNA3 (Invitrogen, San Diego, CA) which has constant 
constitutive expression, or LacSwitch (Stratagene, La Jolla, CA) which is inducible; 
or prokaryotic expression vectors such as pKK233-3 (Clontech, Palo Alto, CA). In 
the event that the protein or fragment integrates into the endoplasmic reticulum or 
plasma membrane of the recombinant cells (e.g., eukaryotic cells), the protein may be 
purified from the membrane fraction. Alternatively, if the protein aggregates in 
inclusion bodies within the recombinant cells (e.g., prokaryotic cells), the protein may 
be purified from whole lysed cells or from solubilized inclusion bodies. 

Purification can be achieved using standard protein purification procedures 
including, but not limited to, gel-filtration chromatography, ion-exchange 
chromatography, high-performance liquid chromatography (RP-HPLC, ion-exchange 
HPLC, size-exclusion HPLC, high-performance chromatofocusing chromatography, 
hydrophobic interaction chromatography, immunoprecipitation, or immunoaffinity 
purification. Gel electrophoresis (e.g., PAGE, SDS-PAGE) can also be used to isolate 
a protein or peptide based on its molecular weight, charge properties and 
hydrophobicity. 

A PS-interacting protein, or a fragment thereof, may also be conveniently 
purified by creating a fusion protein including the desired PS-interacting protein 
sequence fused to another peptide such as an antigenic determinant or poly-His tag 
(e.g., QlAexpress vectors, QIAGEN Corp., Chatsworth, CA), or a larger protein (e.g.. 
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GST using the pGEX-27 vector (Amrad, USA) or green fluorescent protein using the 
Green Lantern vector (GIBCO/BRL. Gaithersburg, MD). The fusion protein may be 
expressed and recovered from prokaryotic or eukaryotic cells and purified by any 
standard method based upon the fusion vector sequence. For example, the fusion 
5 protein may be purified by immunoaffinity or immunoprecipitation with an antibody 
to the non-PS-interacting protein portion of the fusion or, in the case of a poly-His tag, 
by affinity binding to a nickel column. The desired PS-interacting protein or fragment 
may then be further purified from the fusion protein by enzymatic cleavage of the 
fusion protein. Methods for preparing and using such fusion constructs for the 
10 purification of proteins are well known in the art and several kits are commercially 
available for this purpose. In light of the present disclosure, one is now enabled to 
employ such fusion constructs with the PS-interacting proteins. 

3. Antibodies to the PS-interacting Proteins 

15 The present invention also provides antibodies, and methods of making 

antibodies, which selectively bind to the PS-interacting proteins or fragments thereof. 
Of particular importance, by identifying the PS-interacting domains of the PS- 
interacting proteins, and methods of identifying mutant forms of the PS-interacting 
proteins associated with Alzheimer's Disease, the present invention provides 

20 antibodies, and methods of making antibodies, which will selectively bind to and, 

thereby, identify and/or distinguish normal and mutant (i.e., pathogenic) forms of the 
PS-interacting proteins. The antibodies of the invention have utility as laboratory 
reagents for, inter alia , immunoaffinity purification of the PS-interacting proteins, 
Western blotting to identify cells or tissues expressing the PS-interacting proteins, and 

25 immunocytochemistry or immunofluorescence techniques to establish the subcellular 
location of the proteins. In addition, as described below, the antibodies of the 
invention may be used as diagnostics tools to identify carriers of AD-related PS- 
interacting protein alleles, or as therapeutic tools to selectively bind and inhibit 
pathogenic forms of the PS-interacting proteins in vivo . 
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The antibodies of the invention may be generated using the entire PS- 
interacting proteins of the invention, or using any PS-interacting protein epitope 
which is characteristic of that protein and which substantially distinguishes it from 
other host proteins. Any method of choosing antigenic determinants known in the art 
5 may, of course, be employed. Such epitopes may be identified by comparing 
sequences of, for example, 4-10 amino acid residues from a PS-interacting protein 
sequence to computer databases of protein sequences from the relevant host. In 
addition, larger fragments (e.g., 8-20 or, preferably, 9-15 residues) including one or 
more potential epitopes may also be employed Antibodies to the PS-interacting 
10 domains (identified by the yeast two-hybrid assays described below) are expected to 
have the greatest utility both diagnostically and therapeutically. On the other hand, 
antibodies against highly conserved domains are expected to have the greatest utility 
for purification or identification of PS-interacting proteins. 

PS-interacting protein immunogen preparations may be produced from 
15 crude extracts (e.g., lysates or membrane fractions of ceils highly expressing the 

proteins), from proteins or peptides substantially purified from cells which naturally 
or recombinants express them or, for short immunogens, by chemical peptide 
synthesis. The immunogens may also be in the form of a fusion protein in which the 
non-PS-interacting protein region is chosen for its adjuvant properties. As used 
20 herein, a PS-interacting protein immunogen shall be defined as a preparation 

including a peptide comprising at least 4-8, and preferably at least 9-15 consecutive 
amino acid residues of a PS -interacting proteins, as disclosed or otherwise enabled 
herein. Sequences of fewer residues may, of course, also have utility depending upon 
the intended use and future technological developments. Therefore, any PS- 
25 interacting protein derived sequences which are employed to generate antibodies to 
the PS-interacting proteins should be regarded as PS-interacting protein immunogens. 

The antibodies of the invention may be polyclonal or monoclonal, or may 
be antibody fragments, including Fab fragments, F(ab%, and single chain antibody 
fragments. In addition, after identifying useful antibodies by the method of the 
invention, recombinant antibodies may be generated, including any of the antibody 
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fragments listed above, as well as humanized antibodies based upon non-human 
antibodies to the PS -interacting proteins. In light of the present disclosure, as well as 
the characterization of other PS-interacting proteins enabled herein, one of ordinary 
skill in the art may produce the above-described antibodies by any of a variety of 
5 standard means well known in the art. For an overview of antibody techniques, see 
Antibody Engineering: A Practical Guide. Bon-ebaek, ed., W.H. Freeman & 
Company, NY (1992), or Antibody Engineering . 2nd Ed., Borrebaek, ed M Oxford 
University Press, Oxford (1995). 

As a general matter, polyclonal antibodies may be generated by first 

10 immunizing a mouse, rabbit, goat or other suitable animal with the PS-interacting 
protein immunogen in a suitable carrier. To increase the immunogenicity of the 
preparation, the immunogen may be coupled to a carrier protein or mixed with an 
adjuvant (e.g., Freund's adjuvant). Booster injections, although not necessary are 
recommended. After an appropriate period to allow for the development of a humoral 

15 response, preferably several weeks, the animals may be bled and the sera may be 
purified to isolate the immunoglobulin component. 

Similarly, as a general matter, monoclonal anti-PS-interacting protein 
antibodies may be produced by first injecting a mouse, rabbit, goat or other suitable 
animal with a PS-interacting protein immunogen in a suitable carrier. As above, 

20 carrier proteins or adjuvants may be utilized and booster injections (e.g., bi- or tri- 
weekly over 8-10 weeks) are recommended. After allowing for development of a 
humoral response, the animals are sacrificed and their spleens are removed and 
resuspended in, for example, phosphate buffered saline (PBS). The spleen cells serve 
as a source of lymphocytes, some of which are producing antibody of the appropriate 

25 specificity. These cells are then fused with an immortalized cell line (e.g., myeloma), 
and the products of the fusion are plated into a number of tissue culture wells in the 
presence of a selective agent such as HAT. The wells are serially screened and 
replated, each time selecting cells making useful antibody. Typically, several 
screening and replating procedures are carried out until over 90% of the wells contain 

30 single clones which are positive for antibody production. Monoclonal antibodies 
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produced by such clones may be purified by standard methods such as affinity 
chromatography using Protein A Sepharose, by ion-exchange chromatography, or by 
variations and combinations of these techniques. 

The antibodies of the invention may be labelled or conjugated with other 
compounds or materials for diagnostic and/or therapeutic uses. For example, they 
may be coupled to radionuclides, fluorescent compounds, or enzymes for imaging or 
therapy, or to liposomes for the targeting of compounds contained in the liposomes to 
a specific tissue location. 



10 4. Transformed Cell Lines 

The present invention also provides for cells or cell lines, both prokaryotic 
and eukaryotic, which have been transformed or transfected with the nucleic acids of 
the present invention so as to cause clonal propagation of those nucleic acids and/or 
expression of the proteins or peptides encoded thereby. Such cells or cell lines will 
15 have utility both in the propagation and production of the nucleic acids and proteins of 
the present invention but also, as further described herein, as model systems for 
diagnostic and therapeutic assays. In particular, it is expected that cells co- 
transformed with PS-interacting protein sequences as well as presenilin sequences will 
have improved utility as models of the biochemical pathways which may be affected 
20 in AD. For example, cells co-transformed with the interacting domains of PS- 
interacting sequences and presenilins in yeast two-hybrid fusion constructs, will have 
utility in screening for compounds which either enhance or inhibit interactions 
between these domains. Similarly, for cells transformed with a heterospecific 
presenilin, co-transformation with a similarly heterospecific PS-interacting protein, or 
co-transformation and homologous recombination to introduce a similarly 
heterospecific PS-interacting domain of a PS-interacting protein (e.g., "humanizing" a 
non-human endogenous PS-interacting protein), will result in a better model system 
for studying the interactions of the presenilins and the PS-interacting proteins. Cells 
transformed with only PS -interacting sequences will, of course, have utility of their 
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own for studying the role of these proteins in the etiology of AD, and also as 
precursors for presenilin co-transformed cells. 

As used herein, the term "transformed cell" is intended to embrace any 
cell, or the descendant of any cell, into which has been introduced any of the nucleic 
5 acids of the invention, whether by transformation, transfection, infection, or other 
means. Methods of producing appropriate vectors, transforming cells with those 
vectors, and identifying transformants are well known in the art and are only briefly 
reviewed here (see, for example, Sambrook et al. (1989) Molecular Cloning: A 
Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring 

10 Harbor, New York). 

Prokaryotic cells useful for producing the transformed cells of the 
invention include members of the bacterial genera Escherichia (e.g., E. coin . 
Pseudomonas (e.g., P. aeruginosa) , and Bacillus (e.g., B. subtillus . g. 
stearothermophilus ). as well as many others well known and frequently used in the 

15 art. Prokaryotic cells are particularly useful for the production of large quantities of 
the proteins or peptides of the invention (e.g., normal or mutant PS-interacting 
proteins, fragments of the PS-interacting proteins, fusion proteins of the PS- 
interacting proteins). Bacterial cells (e.g., E. coin may be used with a variety of 
expression vector systems including, for example, plasmids with the T7 RNA 

20 polymerase/promoter system, bacteriophage X regulatory sequences, or Ml 3 Phage 
mGPI-2. Bacterial hosts may also be transformed with fusion protein vectors which 
create, for example, lacZ, trpE, maltose-binding protein, poly-His tags, or glutathione- 
s-transferase fusion proteins. All of these, as well as many other prokaryotic 
expression systems, are well known in the art and widely available commercially 

25 (e.g., pGEX-27 (Amrad, USA) for GST fusions). 

Eukaryotic cells and cell lines useful for producing the transformed cells of 
the invention include mammalian cells and cell lines (e.g., PC12, COS, CHO, 
fibroblasts, myelomas, neuroblastomas, hybridomas, human embryonic kidney 293, 
oocytes, embryonic stem cells), insect cells lines (e.g., using baculovirus vectors such 

30 as pPbac or pMbac (Stratagene, La Jolla, CA)), yeast (e.g., using yeast expression 
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vectors such as pYESHIS (Invitrogen, CA)), and fungi. Eukaryotic cells are 
particularly useful for embodiments in which it is necessary that the PS-interacting 
proteins, or functional fragments thereof, perform the functions and/or undergo the 
intracellular interactions associated with either the normal or mutant proteins. Thus, 
for example, transformed eukaryotic cells are preferred for use as models of PS- 
interacting protein function or interaction, and assays for screening candidate 
therapeutics preferably employ transformed eukaryotic cells. 

To accomplish expression in eukaryotic cells, a wide variety of vectors 
have been developed and are commercially available which allow inducible (e.g., 
LacSwitch expression vectors, Stratagene, La Jolla, CA) or cognate (e.g., pcDNA3 
vectors, Invitrogen, Chatsworth, CA) expression of PS-interacting protein nucleotide 
sequences under the regulation of an artificial promoter element. Such promoter 
elements are often derived from CMV or SV40 viral genes, although other strong 
promoter elements which are active in eukaryotic cells can also be employed to induce 
transcription of PS-interacting protein nucleotide sequences. Typically, these vectors 
also contain an artificial polyadenylation sequence and 3' UTR which can also be 
derived from exogenous viral gene sequences or from other eukaryotic genes. 
Furthermore, in some constructs, artificial, non-coding, spliceable introns and exons 
are included in the vector to enhance expression of the nucleotide sequence of interest. 
These expression systems are commonly available from commercial sources and are 
typified by vectors such as pcDNA3 and pZeoSV (Invitrogen, San Diego, CA). 
Innumerable commercially-available as well as custom-designed expression vectors 
are available from commercial sources to allow expression of any desired PS- 
interacting protein transcript in more or less any desired eel! type, either constitutively 
or after exposure to a certain exogenous stimulus (e.g., withdrawal of tetracycline or 
exposure to IPTG). 

Vectors may be introduced into the recipient or "host" cells by various 
methods well known in the art including, but not limited to, calcium phosphate 
transfection, strontium phosphate transfection, DEAE dextran transfection, 
electroporation, lipofection (e.g., Dosper Liposomal transfection reaeent, Boehringer 
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Mannheim, Germany), microinjection, ballistic insertion on micro-beads, protoplast 
fusion or, for viral or phage vectors, by infection with the recombinant virus or phage. 

5. Transgenic Animal Models 
5 The present invention also provides for the production of transgenic non- 

human animal models in which mutant or wild type PS-interacting protein sequences 
are expressed, or in which the PS-interacting protein genes have been inactivated 
(e.g., "knock-out" deletions), for the study of Alzheimer's Disease, for the screening of 
candidate pharmaceutical compounds, for the creation of explanted mammalian CNS 

10 cell cultures (e.g., neuronal, glial, organotypic or mixed cell cultures), and for the 
evaluation of potential therapeutic interventions. Prior to the present invention, a 
partial animal model for Alzheimer's Disease existed via the insertion and over- 
expression of a mutant form of the human amyloid precursor protein gene as a 
minigene under the regulation of the platelet-derived growth factor P receptor 

15 promoter element (Games et aL, 1995). This mutant (PAPP 7I7 Val-*Ile) causes the 
appearance of synaptic pathology and amyloid P peptide deposition in the brain of 
transgenic animals bearing this transgene in high copy number. These changes in the 
brain of the transgenic animal are very similar to that seen in human AD (Games et 
al., 1995). It is, however, as yet unclear whether these animals become demented, but 

20 there is general consensus that it is now possible to recreate at least some aspects of 
AD in mice. In addition, transgenic animal models in which the presenilin genes are 
genetically engineered are disclosed in PCT Publication WO96/34099. These 
transgenic animal models have been shown to have altered Ap production and altered 
hippocampus-dependent memory function. 

25 Animal species suitable for use in the animal models of the present 

invention include, but are not limited to, rats, mice, hamsters, guinea pigs, rabbits, 
dogs, cats, goats, sheep, pigs, and non-human primates (e.g., Rhesus monkeys, 
chimpanzees). For initial studies, transgenic rodents (e.g., mice) may be preferred due 
to their relative ease of maintenance and shorter life spans. However, transgenic yeast 

30 or invertebrates (e.g., nematodes, insects) may be preferred for some studies because 



SUBSTITUTE SHEET (RULE 26) 



WO 97/27296 PCT/CA97/00051 



-39- 



they will allow for even more rapid and inexpensive screening. For example, 
invertebrates bearing mutant PS-interacting protein homologues (or mammalian PS- 
mteracting protein transgenes) which cause a rapidly occurring and easily scored 
phenotype (e.g., abnormal vulva or eye development after several days) can be used as 
5 screens for drugs which block the effect of the mutant gene. Such invertebrates may 
prove far more rapid and efficient for mass screenings than larger vertebrate animals. 
Once lead compounds are found through such screens, they may be tested in higher 
animals such a rodents. Ultimately, transgenic non-human primates may be preferred 
for longer term studies due to their greater similarity to humans and their higher 
10 cognitive abilities. 

Using the nucleic acids disclosed and otherwise enabled herein, there are 
now several available approaches for the creation of a transgenic animal model for 
Alzheimer's Disease. Thus, the enabled animal models include: (1) Animals in which 
sequences encoding at least a functional domain of a normal human PS-interacting 
15 protein gene have been recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or an endogenous 
promoter element, and as either a minigene or a large genomic fragment; in which 
sequences encoding at least a functional domain of a normal human PS-interacting 
protein gene have been recombinantly substituted for one or both copies of the 
20 animal's homologous PS-interacting protein gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
PS-interacting protein genes have been recombinantly "humanized" by the partial 
substitution of sequences encoding the human homologue by homologous 
recombination or gene targeting. These animals are useful for evaluating the effects 
25 of the transgenic procedures, and the effects of the introduction or substitution of a 
human or humanized PS-interacting protein gene. (2) Animals in which sequences 
encoding at least a functional domain of a mutant (i.e., pathogenic) human PS- 
interacting protein gene have been recombinantly introduced into the genome of the 
animal as an additional gene, under the regulation of either an exogenous or an 
endogenous promoter element, and as either a minigene or a large genomic fragment; 
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in which sequences encoding at least a functional domain of a mutant human PS- 
interacting protein gene have been recombinantly substituted for one or both copies of 
the animal's homologous PS-interacting protein gene by homologous recombination 
or gene targeting; and/or in which one or both copies of one of the animal's 
5 homologous PS-interacting protein genes have been recombinantly "humanized" by 
the partial substitution of sequences encoding a mutant human homologue by 
homologous recombination or gene targeting. These animals are useful as models 
which will display some or all of the characteristics, whether at the biochemical, 
physiological and/or behavioral level, of humans carrying one or more alleles which 

10 are pathogenic of Alzheimer's Disease or other diseases associated with mutations in 
the PS-interacting protein genes. (3) Animals in which sequences encoding at least a 
functional domain of a mutant version of one of that animal's PS-interacting protein 
genes (bearing, for example, a specific mutation corresponding to, or similar to, one 
of the pathogenic mutations of the human PS-interacting proteins) have been 

15 recombinantly introduced into the genome of the animal as an additional gene, under 
the regulation of either an exogenous or an endogenous promoter element, and as 
either a minigene or a large genomic fragment; and/or in which sequences encoding at 
least a functional domain of a mutant version of one of that animal's PS-interacting 
protein genes (bearing, for example, a specific mutation corresponding to, or similar 

20 to, one of the pathogenic mutations of the human PS-interacting proteins) have been 
recombinantly substituted for one or both copies of the animal's homologous PS- 
interacting protein gene by homologous recombination or gene targeting. These 
animals are also useful as models which will display some or all of the characteristics, 
whether at the biochemical, physiological and/or behavioral level, of humans carrying 

25 one or more alleles which are pathogenic of Alzheimer's Disease. (4) "Knock-out" 
animals in which one or both copies of one of the animal's PS-interacting protein 
genes have been partially or completely deleted by homologous recombination or 
gene targeting, or have been inactivated by the insertion or substitution by 
homologous recombination or gene targeting of exogenous sequences (e.g., stop 

30 codons, lox p sites). Such animals are useful models to study the effects which loss of 
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PS-interacting protein gene expression may have, to evaluate whether loss of function 
is preferable to continued expression of mutant forms, and to examine whether other 
genes can be recruited to replace a mutant PS-interacting protein or to intervene with 
the effects of other genes (e.g., PS1, PS2, APP or ApoE) causing AD as a treatment 
5 for AD or other disorders. For example, a normal PS-interacting protein gene may be 
necessary for the action of mutant presenilin or APP genes to actually be expressed as 
AD and, therefore, transgenic PS-interacting protein animal models may be of use in 
elucidating such multigenic interactions. 

In addition to transgenic animal models in which the expression of one or 
10 more of the PS-interacting proteins is altered, the present invention also provides for 
the production of transgenic animal models in which the expression of one or more of 
the presenilins, APP, or ApoE is altered. The nucleic acids encoding the presenilins, 
APP, and ApoE are known in the art, a methods for producing transgenic animals with 
these sequences are also known (see, e.g., PCT Publication WO96/34099; Games et 
15 al., 1 995). Indeed, because non-human animals may differ from humans not only in 
their PS-interacting protein sequences, but also in the sequences of their presenilin, 
APP and/or ApoE homologues, it is particularly contemplated that transgenics may be 
produced which bear recombinant normal or mutant human sequences for at least one 
presenilin, APP and/or ApoE gene in addition to recombinant sequences for one or 
more PS-interacting proteins. Such co-transformed animal models would possess 
more elements of the human molecular biology and, therefore, are expected to be 
better models of human disorders. Thus, in accordance with the present invention, 
transgenic animal models may be produced bearing normal or mutant sequences for 
one or more PS-interacting proteins, or interacting domains of these proteins. These 
animals will have utility in that they can be crossed with animals bearing a variety of 
normal or mutant presenilin, APP or ApoE sequences to produce co-transformed 
animal models. Furthermore, as detailed below, it is expected that mutations in the 
PS-interacting genes, like mutations in the presenilins themselves, may be causative 
of Alzheimer's Disease and/or other disorders as well (e.g., other cognitive, 
intellectual, neurological or psychological disorders such as cerebral hemorrhage, 
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schizophrenia, depression, mental retardation and epilepsy). Therefore, transgenic 
animal models bearing normal or mutant sequences corresponding to the PS- 
interacting proteins, absent transformation with any presenilin, APP or ApoE 
sequences, will have utility of their own in the study of such disorders. 
5 As detailed below, preferred choices for transgenic animal models 

transformed with PS-interacting proteins, or domains of PS-interacting proteins, 
include those transformed with normal or mutant sequences corresponding to the 
clones identified and described in Example 1 and disclosed in SEQ ID NOs: 1-12. 
These clones, which interact with normal or mutant PS1 TM6->7 loop domains, were 

10 identified according to the methods described in Example 1, below, and PCT 

Publication WO96/34099. These clones, longer nucleic acid sequences comprising 
these clones, and other clones identified according to this and other methods of the 
invention (e.g., allelic and splice variants or heterospecific homologues of these 
clones) may all be employed in accordance with the present invention to produce 

15 animal models which, with or without co-transformation with presenilin, APP and/or 
ApoE sequences, will have utility in the study of Alzheimer's Disease and/or other 
cognitive, intellectual, neurological or psychological disorders. 

Thus, using the nucleic acids disclosed and otherwise enabled herein, one 
of ordinary skill in the art may now produce any of the following types of transgenic 

20 animal models with altered PS-interacting protein expression: (1) Animals in which 
sequences encoding at least a functional domain of a normal human PS-interacting 
protein gene have been recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or an endogenous 
promoter element, and as either a minigene or a large genomic fragment; in which 

25 sequences encoding at least a functional domain of a normal human PS-interacting 
protein gene have been recombinantly substituted for one or both copies of the 
animal's homologous PS-interacting protein gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
PS-interacting protein genes have been recombinantly "humanized" by the partial 

30 substitution of sequences encoding the human homologue by homologous 
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recombination or gene targeting. These animals are particularly useful for providing 
transgenic models which express human PS-interacting proteins as well as human 
presenilin proteins. They are also useful in evaluating the effects of the transgenic 
procedures, and the effects of the introduction or substitution of a human or 
5 humanized PS-interacting protein gene. (2) Animals in which sequences encoding at 
least a functional domain of a mutant (i.e., pathogenic) human PS-interacting protein 
gene have been recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or an endogenous 
promoter element, and as either a minigene or a large genomic fragment; in which 
10 sequences encoding at least a functional domain of a mutant human PS -interacting 
protein gene have been recombinantly substituted for one or both copies of the 
animal's homologous PS-interacting protein gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
PS-interacting protein genes have been recombinantly "humanized" by the partial 
15 substitution of sequences encoding a mutant human homologue by homologous 
recombination or gene targeting. These animals are useful as models which will 
display some or all of the characteristics, whether at the biochemical, physiological 
and/or behavioral level, of humans carrying one or more alleles which are pathogenic 
of Alzheimer's Disease or other diseases associated with mutations in these PS- 
20 interacting genes. (3) Animals in which sequences encoding at least a functional 
domain of a mutant version of one of that animal's PS-interacting protein genes 
(bearing, for example, a specific mutation corresponding to, or similar to, one of the 
pathogenic mutations of the human PS-interacting proteins) have been recombinantly 
introduced into the genome of the animal as an additional gene, under the regulation 
25 of either an exogenous or an endogenous promoter element, and as either a minigene 
or a large genomic fragment; and/or in which sequences encoding at least a functional 
domain of a mutant version of one of that animal's PS-interacting protein genes 
(bearing, for example, a specific mutation corresponding to, or similar to, one of the 
pathogenic mutations of the humans PS-interacting proteins) have been recombinantly 
30 substituted for one or both copies of the animal's homologous PS-interacting protein 
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gene by homologous recombination or gene targeting. These animals are also useful 
as models which will display some or all of the characteristics, whether at the 
biochemical, physiological and/or behavioral level, of humans carrying one or more 
alleles which are pathogenic of Alzheimer's Disease. (4) "Knock-out" animals in 
5 which one or both copies of one of the animal's PS -interacting protein genes have 
been partially or completely deleted by homologous recombination or gene targeting, 
or have been inactivated by the insertion or substitution by homologous 
recombination or gene targeting of exogenous sequences (e.g., stop codons, lox p 
sites). Such animals are useful models to study the effects which loss of PS- 
interacting protein gene expression may have, to evaluate whether loss of function is 
preferable to continued expression, and to examine whether other genes can be 
recruited to replace a mutant PS-interacting protein or to intervene with the effects of 
other genes (e.g., APP or ApoE) causing AD as a treatment for AD or other disorders. 
For example, a normal PS-interacting protein may be necessary for the action of 
mutant PS1, PS2 or APP genes to actually be expressed as AD and, therefore, 
transgenic PS-interacting protein animal models may be of use in elucidating such 
multigenic interactions. 

In some preferred embodiments, transgenic animal models are produced in 
which just the PS-interacting domains of the PS-interacting proteins are introduced 
into the genome of the animal by homologous recombination. Thus, for example, 
preferred embodiments include transgenic animals in which the PS-interacting 
domains of PS-interacting proteins are "humanized" by homologous recombination 
with sequences from human PS-interacting proteins. These animals may then be bred 
with transgenics in which normal or mutant presenilin sequences have been 
introduced. The progeny of these animals, having both human presenilin and human 
PS-interacting protein sequences, will provide improved animal models for 
Alzheimer's Disease. 

To create an animal model (e.g., a transgenic mouse), a normal or mutant 
PS-interacting gene (e.g., normal or mutant S5a, GT24, p0071, Rabl 1 , etc.), or a 
normal or mutant version of a recombinant nucleic acid encoding at least a functional 



SUBSTITUTE SHEET (RULE 26) 



WO 97/27296 



PCT/CA97/00051 



-45- 



domain of a PS-interacting gene (e.g., the PS-interacting domains obtained in the 
yeast two-hybrid system), can be inserted into a germ line or stem cell using standard 
techniques of oocyte microinjection, or transfection or microinjection into embryonic 
stem cells. Animals produced by these or similar processes are referred to as 
5 transgenic. Similarly, if it is desired to inactivate or replace an endogenous presenilin 
or PS-interacting protein gene, homologous recombination using embryonic stem 
cells may be employed. Animals produced by these or similar processes are referred 
to as "knock-out" (inactivation) or "knock-in" (replacement) models. 

For oocyte injection, one or more copies of the recombinant DNA 
10 constructs of the present invention may be inserted into the pronucleus of a just- 
fertilized oocyte. This oocyte is then reimplanted into a pseudo-pregnant foster 
mother. The livebom animals are screened for integrants using analysis of DNA (e.g., 
from the tail veins of offspring mice) for the presence of the inserted recombinant 
transgene sequences. The transgene may be either a complete genomic sequence 
15 injected as a YAC, BAC, PAC or other chromosome DNA fragment, a cDNA with 

either the natural promoter or a heterologous promoter, or a minigene containing all of 
the coding region and other elements found to be necessary for optimum expression. 

Retroviral infection of early embryos can also be done to insert the 
recombinant DNA constructs of the invention. In this method, the transgene (e.g., a 
20 normal or mutant S5a, GT24, P 007 1 , Rab 1 1 , etc., sequence) is inserted into a 

retroviral vector which is used to infect embryos (e.g., mouse or non-human primate 
embryos) directly during the early stages of development to generate chimeras, some 
of which will lead to germline transmission. 

Homologous recombination using stem cells allows for the screening of 
25 gene transfer cells to identify the rare homologous recombination events. Once 
identified, these can be used to generate chimeras by injection of blastocysts, and a 
proportion of the resulting animals will show germline transmission from the 
recombinant line. This methodology is especially useful if inactivation of a gene is 
desired. For example, inactivation of the S5a gene in mice may be accomplished by 
30 designing a DNA fragment which contains sequences from an S5a coding region 
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flanking a selectable marker. Homologous recombination leads to the insertion of the 
marker sequences in the middle of the coding region, causing inactivation of the S5a 
gene and/or deletion of internal sequences. DNA analysis of individual clones can 
then be used to recognize the homologous recombination events. 
5 The techniques of generating transgenic animals, as well as the techniques 

for homologous recombination or gene targeting, are now widely accepted and 
practiced. A laboratory manual on the manipulation of the mouse embryo, for 
example, is available detailing standard laboratory techniques for the production of 
transgenic mice (Hogan et al., 1986). To create a transgene, the target sequence of 

10 interest (e.g., normal or mutant presenilin sequences, normal or mutant PS-interacting 
protein sequences) are typically ligated into a cloning site located downstream of 
some promoter element which will regulate the expression of RNA from the 
sequence. Downstream of the coding sequence, there is typically an artificial 
polyadenylation sequence. In the transgenic models that have been used to 

15 successfully create animals which mimic aspects of inherited human 

neurodegenerative diseases, the most successful promoter elements have been the 
platelet-derived growth factor receptor p gene subunit promoter and the hamster prion 
protein gene promoter, although other promoter elements which direct expression in 
central nervous system cells would also be useful. An alternate approach to creating a 

20 transgene is to use an endogenous presenilin or PS-interacting protein gene promoter 
and regulatory sequences to drive expression of the transgene. Finally, it is possible 
to create transgenes using large genomic DNA fragments such as YACs which 
contain the entire desired gene as well as its appropriate regulatory sequences. Such 
constructs have been successfully used to drive human APP expression in transgenic 

25 mice (Lamb et al., 1993). 

Animal models can also be created by targeting the endogenous presenilin 
or PS-interacting protein gene in order to alter the endogenous sequence by 
homologous recombination. These targeting events can have the effect of removing 
endogenous sequence (knock-out) or altering the endogenous sequence to create an 

30 amino acid change associated with human disease or an otherwise abnormal sequence 
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(e.g., a sequence which is more like the human sequence than the original animal 
sequence) (knock-in animal models). A large number of vectors are available to 
accomplish this and appropriate sources of genomic DNA for mouse and other animal 
genomes to be targeted are commercially available from companies such as 
5 GenomeSystems Inc. (St. Louis, Missouri, USA). The typical feature of these 
targeting vector constructs is that 2 to 4 kb of genomic DNA is ligated 5' to a 
selectable marker (e.g., a bacterial neomycin resistance gene under its own promoter 
element termed a "neomycin cassette"). A second DNA fragment from the gene of 
interest is then ligated downstream of the neomycin cassette but upstream of a second 
10 selectable marker (e.g., thymidine kinase). The DNA fragments are chosen such that 
mutant sequences can be introduced into the germ line of the targeted animal by 
homologous replacement of the endogenous sequences by either one of the sequences 
included in the vector. Alternatively, the sequences can be chosen to cause deletion of 
sequences that would normally reside between the left and right arms of the vector 
15 surrounding the neomycin cassette. The former is known as a knock-in, the latter is 
known as a knock-out. Again, innumerable model systems have been created, 
particularly for targeted knock-outs of genes including those relevant to 
neurodegenerative diseases (e.g., targeted deletions of the murine APP gene by Zheng 
et al., 1995; targeted deletion of the murine prion gene associated with adult onset 
20 human CNS degeneration by Bueler et al., 1 996). 

Finally, equivalents of transgenic animals, including animals with mutated 
or inactivated presenilin genes, or mutated or inactivated PS-interacting protein genes, 
may be produced using chemical or X-ray mutagenesis of gametes, followed by 
fertilization. Using the isolated nucleic acids disclosed or otherwise enabled herein, 
one of ordinary skill may more rapidly screen the resulting offspring by, for example, 
direct sequencing RFLP, PCR, or hybridization analysis to detect mutants, or 
Southern blotting to demonstrate loss of one allele by dosage. 

6 - Assay? for Drugs Which Affect PS-Tnt^rH n e Protein Evp r^;™ 
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In another series of embodiments, the present invention provides assays for 
identifying small molecules or other compounds which are capable of inducing or 
inhibiting the expression of the PS-interacting genes and proteins (e.g., S5a or GT24). 
The assays may be performed in vitro using non-transformed cells, immortalized cell 
5 lines, or recombinant cell lines, or in vivo using the transgenic animal models enabled 
herein. 

In particular, the assays may detect the presence of increased or decreased 
expression of S5a, GT24, p0071, Rab 1 1, or other PS-interacting genes or proteins on 
the basis of increased or decreased mRNA expression (using, e.g., the nucleic acid 

10 probes disclosed and enabled herein), increased or decreased levels of PS-interacting 
proteins (using, e.g., the anti-PS-interacting protein antibodies disclosed and enabled 
herein), or increased or decreased levels of expression of a marker gene (e.g., P- 
galactosidase or luciferase) operably joined to a PS-interacting protein 5' regulatory 
region in a recombinant construct. 

15 Thus, for example, one may culture cells known to express a particular PS- 

interacting protein and add to the culture medium one or more test compounds. After 
allowing a sufficient period of time (e.g., 0-72 hours) for the compound to induce or 
inhibit the expression of the PS-interacting protein, any change in levels of expression 
from an established baseline may be detected using any of the techniques described 

20 above and well known in the art. In particularly preferred embodiments, the cells are 
from an immortalized cell line such as a human neuroblastoma, glioblastoma or a 
hybridoma cell line. Using the nucleic acid probes and /or antibodies disclosed and 
enabled herein, detection of changes in the expression of a PS-interacting protein, and 
thus identification of the compound as an inducer or repressor of PS-interacting 

25 protein expression, requires only routine experimentation. 

In particularly preferred embodiments, a recombinant assay is employed in 
which a reporter gene such a P-galactosidase, green fluorescent protein , alkaline 
phosphatase, or luciferase is operably joined to the 5' regulatory regions of a PS- 
interacting protein gene. Preferred vectors include the Green Lantern 1 vector 

30 (GIBCO/BRL, Gaithersburg, MD) and the Great EScAPe pSEAP vector (Clontech, 
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Palo Alto). The PS-interacting protein regulatory regions may be easily isolated and 
cloned by one of ordinary skill in the art in light of the present disclosure of coding 
regions from these genes. The reporter gene and regulatory regions are joined in- 
frame (or in each of the three possible reading frames) so that transcription and 
5 translation of the reporter gene may proceed under the control of the PS-interacting 
protein regulatory elements. The recombinant construct may then be introduced into 
any appropriate cell type, although mammalian cells are preferred, and human cells 
are most preferred. The transformed cells may be grown in culture and, after 
establishing the baseline level of expression of the reporter gene, test compounds may 
10 be added to the medium. The ease of detection of the expression of the reporter gene 
provides for a rapid, high through-put assay for the identification of inducers and 
repressors of the PS-interacting protein gene. 

Compounds identified by this method will have potential utility in 
modifying the expression of the PS-interacting protein genes in vivo . These 
15 compounds may be further tested in the animal models disclosed and enabled herein 
to identify those compounds having the most potent in vivo effects. In addition, as 
described herein with respect to small molecules having binding activity for PS- 
interacting proteins, these molecules may serve as "lead compounds" for the further 
development of pharmaceuticals by, for example, subjecting the compounds to 
20 sequential modifications, molecular modeling, and other routine procedures employed 
in rational drug design. 

7 - Identification of Compounds with PS-T nteracting Protein Binding Cap acity 

In light of the present disclosure, one of ordinary skill in the art is enabled 

25 to practice new screening methodologies which will be useful in the identification of 
proteins and other compounds which bind to, or otherwise directly interact with, the 
PS-interacting proteins. The proteins and compounds will include endogenous 
cellular components, aside from the presenilins, which interact with the PS-interacting 
proteins in vivo and which, therefore, provide new targets for pharmaceutical and 

10 therapeutic interventions, as well as recombinant, synthetic and otherwise exogenous 
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compounds which may have PS-interacting protein binding capacity and, therefore, 
may be candidates for pharmaceutical agents. Thus, in one series of embodiments, 
cell lysates or tissue homogenates (e.g., human brain homogenates, lymphocyte 
lysates) may be screened for proteins or other compounds which bind to one of the 
5 normal or mutant PS-interacting proteins. Alternatively, any of a variety of 

exogenous compounds, both naturally occurring and/or synthetic (e.g., libraries of 
small molecules or peptides), may be screened for PS-interacting protein binding 
capacity. Small molecules are particular preferred in this context because they are 
more readily absorbed after oral administration, have fewer potential antigenic 
10 determinants, and/or are more likely to cross the blood brain barrier than larger 

molecules such as nucleic acids or proteins. The methods of the present invention are 
particularly useful in that they may be used to identify molecules which selectively or 
preferentially bind to a mutant form of a PS-interacting protein (rather than a normal 
form) and, therefore, may have particular utility in treating cases of AD which arise 
15 from mutations in the PS -interacting proteins. 

Once identified by the methods described above, the candidate compounds 
may then be produced in quantities sufficient for pharmaceutical administration or 
testing (e.g., ng or mg or greater quantities), and formulated in a pharmaceutical^ 
acceptable carrier (see, e.g., Remington's Pharmaceutical Sciences Gennaro, A., ed., 
20 Mack Pub., 1990). These candidate compounds may then be administered to the 

transformed cells of the invention, to the transgenic animal models of the invention, to 
cell lines derived from the animal models or from human patients, or to Alzheimer's 
patients. The animal models described and enabled herein are of particular utility in 
further testing candidate compounds which bind to normal or mutant PS-interacting 
25 proteins for their therapeutic efficacy. 

In addition, once identified by the methods described above, the candidate 
compounds may also serve as "lead compounds" in the design and development of 
new pharmaceuticals. For example, as in well known in the art, sequential 
modification of small molecules (e.g., amino acid residue replacement with peptides; 
30 functional group replacement with peptide or non-peptide compounds) is a standard 
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approach in the pharmaceutical industry for the development of new pharmaceuticals. 
Such development generally proceeds from a "lead compound" which is shown to 
have at least some of the activity (e.g., PS-interacting protein binding or blocking 
ability) of the desired pharmaceutical. In particular, when one or more compounds 
5 having at least some activity of interest (e.g., modulation of PS-interacting protein 
activity) are identified, structural comparison of the molecules can greatly inform the 
skilled practitioner by suggesting portions of the lead compounds which should be 
conserved and portions which may be varied in the design of new candidate 
compounds. Thus, the present invention also provides a means of identifying lead 
10 compounds which may be sequentially modified to produce new candidate 

compounds for use in the treatment of Alzheimer's Disease. These new compounds 
then may be tested both for binding to PS-interacting proteins and/or blocking PS- 
interacting protein activity, and for therapeutic efficacy (e.g., in the animal models 
described herein). This procedure may be iterated until compounds having the desired 
15 therapeutic activity and/or efficacy are identified. 

In each of the present series of embodiments, an assay is conducted to 
detect binding between a "PS-interacting protein component" and some other moiety. 
Of particular utility will be sequential assays in which compounds are tested for the 
ability to bind to only normal or only mutant forms of the PS-interacting domains of 
20 PS-interacting proteins in the binding assays. Such compounds are expected to have 
the greatest therapeutic utilities, as described more fully below. The "PS-interacting 
protein component" in these assays may be a complete normal or mutant form of a 
PS-interacting protein (e.g., S5a, GT24, p0071, Rab 1 1, etc.) but need not be. Rather, 
particular functional domains of the PS-interacting proteins, particularly the PS- 
25 interacting domains as described above, may be employed either as separate 
molecules or as part of a fusion protein. For example, to isolate proteins or 
compounds that interact with these functional domains, screening may be carried out 
using fusion constructs and/or synthetic peptides corresponding to these regions. 
Thus, for S5a, GST-fusion peptides may be made including sequences corresponding 
30 approximately to amino acids 70-377 of SEQ ID NO: 2 (included in clones Y2H29 
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and Y2H31, see Example 1), approximately to amino acids 206-377 of SEQ ID NO: 2 
(which includes protein-protein interaction motifs, see Ferrell et al., 1996), or to any 
other S5a domain of interest. Similarly, for GT24, GST- or other fusion peptides may 
be produced including sequences corresponding approximately to amino acids 440- 
5 815 of SEQ ID NO: 4 (including part of the armadillo repeat segment). Obviously, 
various combinations of fusion proteins and PS-interacting protein functional domains 
are possible and these are merely examples. In addition, the functional domains may 
be altered so as to aid in the assay by, for example, introducing into the fiinctional 
domain a reactive group or amino acid residue (e.g., cysteine) which will facilitate 

10 immobilization of the domain on a substrate (e.g., using sulfhydryl reactions). Thus, 
for example, the PS-interacting domain of S5a may be synthesized containing an 
additional C-terminal cysteine residue to facilitate immobilization of the domain. 
Such peptides may be used to create an affinity substrate for affinity chromatography 
(Suifo-link; Pierce) to isolate binding proteins for microsequencing. Similarly, other 

15 functional domain or antigenic fragments may be created with modified residues (see, 
e.g., Example 4). 

The proteins or other compounds identified by these methods may be 
purified and characterized by any of the standard methods known in the art. Proteins 
may, for example, be purified and separated using electrophoretic (e.g., SDS-PAGE, 

20 2D PAGE) or chromatographic (e.g., HPLC) techniques and may then be 

microsequenced. For proteins with a blocked N-terminus, cleavage (e.g., by CNBr 
and/or trypsin) of the particular binding protein is used to release peptide fragments. 
Further purification/characterization by HPLC and microsequencing and/or mass 
spectrometry by conventional methods provides internal sequence data on such 

25 blocked proteins. For non-protein compounds, standard organic chemical analysis 
techniques (e.g., IR, NMR and mass spectrometry; functional group analysis; X-ray 
crystallography) may be employed to determine their structure and identity. 

Methods for screening cellular lysates, tissue homogenates, or small 
molecule libraries for candidate PS-interaction protein-binding molecules are well 

30 known in the art and, in light of the present disclosure, may now be employed to 
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identify compounds which bind to normal or mutant PS-interacting protein 
components or which modulate PS-interacting protein activity as defined by non- 
specific measures (e.g., changes in intracellular Ca 2 \ GTP/GDP ratio) or by specific 
measures (e.g., changes in Ap peptide production or changes in the expression of 
5 other downstream genes which can be monitored by differential display, 2D gel 

electrophoresis, differential hybridization, or SAGE methods). The preferred methods 
involve variations on the following techniques: (1) direct extraction by affinity 
chromatography; (2) co-isolation of PS-interacting protein components and bound 
proteins or other compounds by immunoprecipitation; (3) the Biomolecular 
10 Interaction Assay (BIAcore); and (4) the yeast two-hybrid systems. These and others 
are discussed separately below. 

A. Affinity Chromatography 

In light of the present disclosure, a variety of affinity binding techniques 
well known in the art may be employed to isolate proteins or other compounds which 
15 bind to the PS-interacting protein disclosed or otherwise enabled herein. In general, a 
PS-interacting protein component may be immobilized on a substrate (e.g., a column 
or filter) and a solution including the test compound(s) is contacted with the PS- 
interacting protein, fusion or fragment under conditions which are permissive for 
binding. The substrate is then washed with a solution to remove unbound or weakly 
20 bound molecules. A second wash may then elute those compounds which strongly 
bound to the immobilized normal or mutant PS-interacting protein component. 
Alternatively, the test compounds may be immobilized and a solution containing one 
or more PS-interacting protein components may be contacted with the column, filter 
or other substrate. The ability of the PS-interacting protein component to bind to the 
25 test compounds may be determined as above or a labeled form of the PS-interacting 
protein component (e.g., a radio-labeled or chemiluminescent functional domain) may 
be used to more rapidly assess binding to the substrate-immobilized compound(s). 
B. Co-Immunoprecip itatinn 

Another well characterized technique for the isolation of PS-interacting 
30 protein components and their associated proteins or other compounds is direct 
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immunoprecipitation with antibodies. This procedure has been successfully used, for 
example, to isolate many of the synaptic vesicle associated proteins (Phizicky and 
Fields, 1994). Thus, either normal or mutant, free or membrane-bound PS-interacting 
protein components may be mixed in a solution with the candidate compound(s) 
5 under conditions which are permissive for binding, and the PS-interacting protein 
component may be immunoprecipitated. Proteins or other compounds which co- 
immunoprecipitate with the PS-interacting protein component may then be identified 
by standard techniques as described above. General techniques for 
immunoprecipitation may be found in, for example, Harlow and Lane, (1988) 
10 Antibodies: A Laboratory Manual , Cold Spring Harbor Press, Cold Spring Harbor, 
NY. 

The antibodies employed in this assay, as described and enabled herein, 
may be polyclonal or monoclonal, and include the various antibody fragments (e.g., 
Fab, F(ab%) as well as single chain antibodies, and the like. 

15 C. The Biomolecular Interaction Assay 

Another useful method for the detection and isolation of binding proteins 
is the Biomolecular Interaction Assay or "BIAcore" system developed by Pharmacia 
Biosensor and described in the manufacturer's protocol (LKB Pharmacia, Sweden). In 
light of the present disclosure, one of ordinary skill in the art is now enabled to 

20 employ this system, or a substantial equivalent, to identify proteins or other 

compounds having PS-interacting protein binding capacity. The BIAcore system uses 
an affinity purified anti-GST antibody to immobilize GST- fusion proteins onto a 
sensor chip. Obviously, other fusion proteins and corresponding antibodies may be 
substituted. The sensor utilizes surface plasmon resonance which is an optical 

25 phenomenon that detects changes in refractive indices. A homogenate of a tissue of 
interest is passed over the immobilized fusion protein and protein-protein interactions 
are registered as changes in the refractive index. This system can be used to 
determine the kinetics of binding and to assess whether any observed binding is of 
physiological relevance. 
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D. The Yeast Two-Hybrid Sy ct/» m 

The yeast "two-hybrid" system takes advantage of transcriptional factors 
that are composed of two physically separable, functional domains (Phizicky and 
Fields, 1994). The most commonly used is the yeast GAL4 transcriptional activator 
consisting of a DNA binding domain and a transcriptional activation domain. Two 
different cloning vectors are used to generate separate fusions of the GAL4 domains 
to genes encoding potential binding proteins. The fusion proteins are co-expressed, 
targeted to the nucleus and, if interactions occur, activation of a reporter gene (e.g., 
lacZ) produces a detectable phenotype. For example, the Clontech Matchmaker 
System-2 may be used with the Clontech brain cDNA GAL4 activation domain fusion 
library with PS-interacting protein-GAL4 binding domain fusion clones (Clontech, 
Palo Alto, CA). In light of the disclosures herein, one of ordinary skill in the art is 
now enabled to produce a variety of PS-interacting protein fusions, including fusions 
including either normal or mutant functional domains of the PS-interacting proteins, 
and to screen such fusion libraries in order to identify PS -interacting protein binding 
proteins. 

E. Other Methods 

The nucleotide sequences and protein products, including both mutant and 
normal forms of these nucleic acids and their corresponding proteins, can be used with 
the above techniques to isolate other interacting proteins, and to identify other genes 
whose expression is altered by the over-expression of normal PS-interacting protein 
sequences, by the under-expression of normal PS-interacting protein sequences, or by 
the expression of mutant PS-interacting protein sequences. Identification of these 
other interacting proteins, as well as the identification of other genes whose 
expression levels are altered in AD will identify other gene targets which have direct 
relevance to the pathogenesis of this disease in its clinical or pathological forms. 
Specifically, other genes will be identified which may themselves be the site of other 
mutations causing Alzheimer's Disease, or which can themselves be targeted 
therapeutically (e.g., to reduce their expression levels to normal, or to 
pharmacologically block the effects of their over-expression) as a potential treatment 
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for this disease. Specifically, these techniques rely on PCR-based and/or 
hybridization-based methods to identify genes which are differentially expressed 
between two conditions (a cell line expressing normal PS-interacting proteins 
compared to the same cell type expressing a mutant PS-interacting protein). These 
5 techniques include differential display, serial analysis of gene expression (SAGE), and 
mass-spectrometry of protein 2D-gels and subtractive hybridization (reviewed in 
Nowak, 1995 and Kahn, 1995). 

As will be obvious to one of ordinary skill in the art, there are numerous 
other methods of screening individual proteins or other compounds, as well as large 

10 libraries of proteins or other compounds (e.g., phage display libraries and cloning 

systems from Stratagene, La Jolla, CA) to identify molecules which bind to normal or 
mutant PS-interacting protein components. All of these methods comprise the step of 
mixing a normal or mutant PS-interacting protein, fusion, or fragment with test 
compounds, allowing for binding (if any), and assaying for bound complexes. All 

15 such methods are now enabled by the present disclosure of substantially pure PS- 
interacting proteins, substantially pure PS-interacting functional domain fragments, 
PS-interacting protein fusion proteins, PS-interacting protein antibodies, and methods 
of making and using the same. 

20 8. Disrupting PS-Interactine Protein Interactions 

The ability to disrupt specific interactions of the PS-interacting proteins 
with the presenilins, or with other proteins, is potentially of great therapeutic value, 
and will be important in understanding the etiology of AD and in identifying 
additional targets for therapy. The methods used to identify compounds which disrupt 

25 PS-interacting protein interactions may be applied equally well to interactions 
involving either normal or mutant PS-interacting proteins. 

Assays for compounds which can disrupt PS-interacting protein 
interactions may be performed by any of a variety of methods well known in the art. 
In essence, such assays will parallel those assays for identifying proteins and 

30 compounds with binding activity toward the PS-interacting proteins. Thus, once a 
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compound with binding activity for a PS-interacting protein is identified by any 
method, that method or an equivalent method may be performed in the presence of 
candidate compounds to identify compounds which disrupt the interaction. Thus, for 
example, the assay may employ methods including (1) affinity chromatography; (2) 
immunoprecipitation; (3) the Biomolecular Interaction Assay (BIAcore); or (4) the 
yeast two-hybrid systems. Such assays can be developed using either normal or 
mutant purified PS-interacting proteins, and/or either normal or mutant purified 
binding proteins (e.g., normal or mutant presenilins). 

For affinity methods, either the PS-interacting protein or its binding 
partner may be affixed to a matrix, for example in a column, and the counterpart 
protein (e.g., the PS-interacting protein if presenilin or another binding partner is 
affixed to the matrix; or a presenilin or other binding partner if the PS -interacting 
protein is affixed to the matrix) is then exposed to the affixed protein/compound either 
before or after adding the candidate compound(s). In the absence of a disruptive 
effect by the candidate compound(s), the interaction between the PS-interacting 
protein and its binding partner will cause the counterpart protein to bind to the affixed 
protein. Any compound which disrupts the interaction will cause release of the 
counterpart protein from the matrix. Release of the counterpart protein from the 
matrix can be measured using methods known in the art. 

For PS-interacting protein interactions which are detectable by yeast two- 
hybrid systems, these assays may also be employed to identify compounds which 
disrupt the interaction. Briefly, a PS-interacting protein and its binding partner (or 
appropriate structural domains of each) are employed in the fusion proteins of the 
system, and the cells are exposed to candidate compounds to determine their effect 
upon the expression of the reporter gene. By appropriate choice of a reporter gene, 
such a system can be readily adapted for high through-put screening of large libraries 
of compounds by, for example, using a reporter gene which confers resistance to an 
antibiotic which is present in the medium, or which rescues an auxotrophic strain 
grown in minimal medium. 
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These assays may be used to screen many different types of compounds for 
their disruptive effect on the interactions of the PS-interacting proteins. For example, 
the compounds may belong to a library of synthetic molecules, or be specifically 
designed to disrupt the interaction. The compounds may also be peptides 
5 corresponding to the interacting domain of either protein. This type of assay can be 
used to identify compounds that disrupt a specific interaction between a given PS- 
interacting protein variant and a given binding partner. In addition, compounds that 
disrupt all interactions with PS-interacting proteins may be identified. For example, a 
compound that specifically disrupts the folding of PS-interacting proteins would be 
10 expected to disrupt all interactions between PS-interacting proteins and other proteins. 
Alternatively, this type of disruption assay can be used to identify compounds which 
disrupt only a range of different PS-interacting protein interactions, or only a single 
PS-interacting protein interaction. 

15 9. Methods of Identifying Compounds Modulating PS-Interacting Protein Activity 
In another series of embodiments, the present invention provides for 
methods of identifying compounds with the ability to modulate the activity of normal 
and mutant PS-interacting proteins. As used with respect to this series of 
embodiments, the term "activity" broadly includes gene and protein expression, PS- 

20 interacting protein post-translation processing, trafficking and localization, and any 
functional activity (e.g., enzymatic, receptor-effector, binding, channel), as well as 
downstream affects of any of these. It is known that Alzheimer's Disease is associated 
with increased production of the long form of Ap peptides, the appearance of amyloid 
plaques and neurofibrillary tangles, decreases in cognitive function, and apoptotic cell 

25 death. Therefore, using the transformed cells and transgenic animal models of the 
present invention, cells obtained from subjects bearing normal or mutant PS- 
interacting protein genes, or animals or human subjects bearing naturally occurring 
normal or mutant PS-interacting proteins, it is now possible to screen candidate 
pharmaceuticals and treatments for their therapeutic effects by detecting changes in 
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one or more of these functional characteristics or phenotypic manifestations of normal 
or mutant PS -interacting protein expression. 

Thus, the present invention provides methods for screening or assaying for 
proteins, small molecules or other compounds which modulate PS-interacting protein 
5 activity by contacting a cell in vivo or in vitro with a candidate compound and 
assaying for a change in a marker associated with normal or mutant PS -interacting 
protein activity. The marker associated with PS-interacting protein activity may be 
any measurable biochemical, physiological, histological and/or behavioral 
characteristic associated with PS-interacting protein expression. In particular, useful 
10 markers will include any measurable biochemical, physiological, histological and/or 
behavioral characteristic which distinguishes cells, tissues, animals or individuals 
bearing at least one mutant presenilin or PS-interacting protein gene from their normal 
counterparts. In addition, the marker may be any specific or non-specific measure of 
presenilin or PS-interacting protein activity. PS-interacting protein specific measures 
15 include measures of PS-interacting protein expression (e.g., PS-interacting protein 
mRNA or protein levels) which may employ the nucleic acid probes or antibodies of 
the present invention. Non-specific measures include changes in cell physiology such 
as pH, intracellular calcium, cyclic AMP levels, GTP/GDP ratios, 
phosphatidylinositol activity, protein phosphorylation, etc., which can be monitored 
20 on devices such as the cytosensor microphysiometer (Molecular Devices Inc., United 
States). The activation or inhibition of PS-interacting protein activity in its mutant or 
normal form can also be monitored by examining changes in the expression of other 
genes (e.g., the presenilins) which are specific to the PS-interacting protein pathway 
leading to Alzheimer's Disease. These can be assayed by such techniques as 
25 differential display, differential hybridization, and SAGE (sequential analysis of gene 
expression), as well as by two dimensional gel electrophoresis of cellular lysates. In 
each case, the differentially-expressed genes can be ascertained by inspection of 
identical studies before and after application of the candidate compound. 
Furthermore, as noted elsewhere, the particular genes whose expression is modulated 
JO by the administration of the candidate compound can be ascertained by cloning, 
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nucleotide sequencing, amino acid sequencing, or mass spectrometry (reviewed in 
Nowak, 1995). 

In general, a cell may be contacted with a candidate compound and, after 
an appropriate period (e.g., 0-72 hours for most biochemical measures of cultured 
5 cells), the marker of presenilin or PS-interacting protein activity may be assayed and 
compared to a baseline measurement. The baseline measurement may be made prior 
to contacting the cell with the candidate compound or may be an external baseline 
established by other experiments or known in the art. The cell may be a transformed 
cell of the present invention or an explant from an animal or individual. In particular, 

10 the cell may be an explant from a carrier of a presenilin or PS-interacting protein 

mutation (e.g., a human subject with Alzheimer's Disease) or an animal model of the 
invention (e.g., a transgenic nematode or mouse bearing a mutant presenilin or PS- 
interacting protein gene). To augment the effect of presenilin or PS-interacting 
protein mutations on the AfJ pathway, transgenic cells or animals may be employed 

15 which have increased Ap production. Preferred cells include those from neurological 
tissues such as neuronal, glial or mixed cell cultures; and cultured fibroblasts, liver, 
kidney, spleen, or bone marrow. The cells may be contacted with the candidate 
compounds in a culture in vitro or may be administered in vivo to a live animal or 
human subject. For live animals or human subjects, the test compound may be 

20 administered orally or by any parenteral route suitable to the compound. For clinical 
trials of human subjects, measurements may be conducted periodically (e.g., daily, 
weekly or monthly) for several months or years. 

Because most individuals bearing a mutation in a particular gene are 
heterozygous at that locus (i.e., bearing one normal and one mutant allele), 

25 compounds may be tested for their ability to modulate normal as well as mutant 
presenilin or PS-interacting protein activity. Thus, for example, compounds which 
enhance the function of normal presenilins or PS-interacting proteins may have utility 
in treating Alzheimer's Disease or related disorders. Alternatively, because 
suppression of the activity of both normal and mutant copies of a gene in a 

30 heterozygous individual may have less severe clinical consequences than progression 
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of the associated disease, it may be desired to identify compound which inactivate or 
suppress all forms of the presenilins, the PS-interacting proteins, or their interactions. 
Preferably, however, compounds are identified which selectively or specifically 
inactivate or suppress the activity of mutant presenilin or PS-interacting proteins 
5 without disrupting the function of their normal counterparts. 

In light of the identification, characterization, and disclosure herein of a 
novel group of PS-interacting genes and proteins, the PS-interacting protein nucleic 
acid probes and antibodies, and the PS-interacting protein transformed cells and 
transgenic animals of the invention, one of ordinary skill in the art is now enabled by 
10 perform a great variety of assays which will detect the modulation of presenilin and/or 
PS-interacting protein activity by candidate compounds. Particularly preferred and 
contemplated embodiments are discussed in some detail below. 

A. PS-Interacting Protein Expression 

In one series of embodiments, specific measures of PS-interacting protein 
15 expression are employed to screen candidate compounds for their ability to affect 

presenilin activity. Thus, using the PS-interacting protein nucleic acids and antibodies 
disclosed and otherwise enabled herein, one may use mRNA levels or protein levels 
as a marker for the ability of a candidate compound to modulate PS-interacting 
protein activity. The use of such probes and antibodies to measure gene and protein 
20 expression is well known in the art and discussed elsewhere herein. Of particular 

interest may be the identification of compounds which can alter the relative levels of 
different variants (e.g., mutant and normal) of the PS-interacting proteins. 

B. Intracellular Localization 

In another series of embodiments, compounds may be screened for their 
25 ability to modulate the activity of the PS-interacting proteins based upon their effects 
on the trafficking and intracellular localization of the PS-interacting proteins. The 
presenilins and some of the PS-interacting proteins (e.g., S5a) have been seen 
immunocytochemically to be localized in membrane structures associated with the 
endoplasmic reticulum and Golgi apparatus. Differences in localization of mutant and 
JO normal presenilins or PS-interacting proteins may, therefore, contribute to the etiology 
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of Alzheimer's Disease and related disorders. Compounds which can affect the 
localization of these proteins may, therefore, be identified as potential therapeutics. 
Standard techniques known in the art may be employed to detect the localization of 
the presenilins and PS-interacting proteins. Generally, these techniques will employ 
5 the antibodies of the present invention, and in particular antibodies which selectively 
bind to one or more mutant PS-interacting proteins but not to normal proteins. As is 
well known in the art, such antibodies may be labeled by any of a variety of 
techniques (e.g., fluorescent or radioactive tags, labeled secondary antibodies, avidin- 
biotin, etc.) to aid in visualizing the intracellular location of these proteins. The PS- 

10 interacting proteins may be co-localized to particular structures, as in known in the 
art, using antibodies to markers of those structures (e.g., TGN38 for the Golgi, 
transferrin receptor for post-Golgi transport vesicles, LAMP2 for ly so somes). 
Western blots of purified fractions from cell lysates enriched for different intracellular 
membrane bound organelles (e.g., lysosomes, synaptosomes, Golgi) may also be 

15 employed. 

B. Ion Regulation/Metabolism 

In another series of embodiments, compounds may be screened for their 
ability to modulate the activity of the presenilins or PS-interacting proteins based 
upon measures in intracellular Ca 2+ , Na* or K* levels or metabolism. As noted above, 

20 the presenilins are membrane associated proteins which may serve as, or interact with, 
ion receptors or ion channels. Thus, compounds may be screened for their ability to 
modulate presenilin and PS-interacting protein-related metabolism of calcium or other 
ions either in vivo or in vitro by, for example, measurements of ion channel fluxes 
and/or transmembrane voltage and/or current fluxes, using patch clamps, voltage 

25 clamps or fluorescent dyes sensitive to intracellular ion levels or transmembrane 
voltage. Ion channel or receptor function can also be assayed by measurements of 
activation of second messengers such as cyclic AMP, cGMP tyrosine kinases, 
phosphates, increases in intracellular Ca 2+ levels, etc. Recombinantly made proteins 
may also be reconstructed in artificial membrane systems to study ion channel 

30 conductance and, therefore, the "cell" employed in such assays may comprise an 



SUBSTITUTE SHEET (RULE 26) 



WO 97/27296 



PCT/CA97/00051 



-63- 



artificial membrane or cell. Assays for changes in ion regulation or metabolism can 
be performed on cultured cells expressing endogenous normal or mutant presenilins 
and PS-interacting proteins. Such studies also can be performed on cells transfected 
with vectors capable of expressing one of the presenilins or PS-interacting proteins, or 
5 functional domains of one of the presenilins or PS-interacting proteins, in normal or 
mutant form. In addition, to enhance the signal measured in such assays, cells may be 
co-transfected with genes encoding ion channel proteins. For example, Xenopus 
oocytes or rat kidney (HEK293) cells may be co-transfected with sequences encoding 
rat brain Na* pi subunits, rabbit skeletal muscle Ca 2+ 01 subunits, or rat heart IC pi 
10 subunits. Changes in presenilin or PS-interacting protein-mediated ion channel 
activity can be measured by, for example, two-microelectrode voltage-clamp 
recordings in oocytes, by whole-cell patch-clamp recordings in HEK293 cells, or by 
equivalent means. 

C. Apoptosis or Cell Death 
15 111 another series of embodiments, compounds may be screened for their 

ability to modulate the activity of the presenilins or PS-interacting proteins based 
upon their effects on presenilin or PS -interacting protein-related apoptosis or cell 
death. Thus, for example, baseline rates of apoptosis or cell death may be established 
for cells in culture, or the baseline degree of neuronal loss at a particular age may be 
20 established post-mortem for animal models or human subjects, and the ability of a 
candidate compound to suppress or inhibit apoptosis or cell death may be measured. 
Cell death may be measured by standard microscopic techniques (e.g., light 
microscopy) or apoptosis may be measured more specifically by characteristic nuclear 
morphologies or DNA fragmentation patterns which create nucleosomal ladders (see, 
25 e.g., Gavrieli et al., 1992; Jacobson et al., 1993; Vito et al., 1996). TUNEL may also 
be employed to evaluate cell death in brain (see, e.g., Lassmann et al., 1995). In 
preferred embodiments, compounds are screened for their ability to suppress or inhibit 
neuronal loss in the transgenic animal models of the invention. Transgenic mice 
bearing, for example, a mutant human, mutant mouse, or humanized mutant presenilin 
30 or PS-interacting protein gene may be employed to identify or evaluate compounds 



SUBSTITUTE SHEET (RULE 26) 



WO 97/27296 



PCT/CA97A00051 



-64- 



which may delay or arrest the neurodegeneration associated with Alzheimer's 
Disease. A similar transgenic mouse model, bearing a mutant APP gene, has recently 
been reported by Games et al. (1995). 

D. Ap Peptide Production 
5 In another series of embodiments, compounds may be screened for their 

ability to modulate presenilin or PS-interacting protein-related changes in APP 
processing. The Ap peptide is produced in several isoforms resulting from differences 
in APP processing. The Ap peptide is a 39 to 43 amino acid derivative of pAPP 
which is progressively deposited in diffuse and senile plaques and in blood vessels of 

10 subjects with AD. In human brain, AP peptides are heterogeneous at both the N- and 
C-termini. Several observations, however, suggest that both the full length and N- 
terminal truncated forms of the long-tailed AP peptides ending at residue 42 or 43 
(i.e., Ap 1-42/43 and Apx-42/43) have a more important role in AD than do peptides 
ending at residue 40. Thus, Ap 1-42/43 and Apx-42/43 are an early and prominent 

15 feature of both senile plaques and diffuse plaques, while peptides ending at residue 40 
(i.e., Apl-40 and APx-40) are predominantly associated with a subset of mature 
plaques and with amyloidotic blood vessels (see, e.g., Iwatsubo et al., 1995; Gravina 
et al., 1995; Tamaoka et al., 1995; Podlisny et al. 1995). Furthermore, the long-tailed 
isoforms have a greater propensity to fibril formation, and are thought to be more 

20 neurotoxic than Ap 1 -40 peptides (Pike et al., 1 993 ; Hilbich et ah, 1 99 1 ). Finally, 
missense mutations at codon 717 of the PAPP gene are associated with early onset 
FAD, and result in overproduction of long-tailed AP in the brain of affected mutation 
earners, in peripheral cells and plasma of both affected and presymptomatic carriers, 
and in cell lines transfected with pAPP 717 mutant cDNAs (Tamaoka et al., 1994; 

25 Suzuki et al., 1994). 

Thus, in one series of embodiments, the present invention provides 
methods for screening candidate compounds for their ability to block or inhibit the 
increased production of long isoforms of the Ap peptides in cells or transgenic 
animals expressing a normal or mutant presenilin gene and/or a normal or mutant PS- 

30 interacting protein gene. In particular, the present invention provides such methods in 
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which cultured mammalian cells, such as brain cells or fibroblasts, have been 
transformed according to the methods disclosed herein, or in which transgenic 
animals, such as rodents or non-human primates, have been produced by the methods 
disclosed herein, to express relatively high levels of a normal or mutant presenilin or 
PS-interacting protein. Optionally, such cells or transgenic animals may also be 
transformed so as to express a normal or mutant form of the PAPP protein at 
relatively high levels. 

In this series of embodiments, the candidate compound is administered to 
the cell line or transgenic animals (e.g., by addition to the media of cells in culture; or 
by oral or parenteral administration to an animal) and, after an appropriate period 
(e.g., 0-72 hours for cells in culture, days or months for animal models), a biological 
sample is collected (e.g., cell culture supernatant or cell lysate from cells in culture; 
tissue homogenate or plasma from an animal) and tested for the level of the long 
isoforms of the Ap peptides. The levels of the peptides may be determined in an 
15 absolute sense (e.g., nMol/ml) or in a relative sense (e.g., ratio of long to short Ap 
isoforms). The AP isoforms may be detected by any means known in the art (e.g., 
electrophoretic separation and sequencing) but, preferably, antibodies which are 
specific to the long isoform are employed to determine the absolute or relative levels 
of the Ap 1-42/43 or Apx-42/43 peptides. Candidate pharmaceuticals or therapies 
20 which reduce the absolute or relative levels of these long Ap isoforms, particularly in 
the transgenic animal models of the invention, are likely to have therapeutic utility in 
the treatment of Alzheimer's Disease, or other disorders caused by mutations in the 
presenilins or PS-interacting proteins, or by other aberrations in APP metabolism. 
E- Phosphor ylation of Microtubule Associated Proteins 
25 In another series of embodiments, candidate compounds may be screened 

for their ability to modulate presenilin or PS-interacting protein activity by assessing 
the effect of the compound on levels of phosphorylation of microtubule associated 
proteins (MAPs) such as tau. The abnormal phosphorylation of tau and other MAPs 
in the brains of victims of Alzheimer's Disease is well known in the art. Thus, 
30 compounds which prevent or inhibit the abnormal phosphorylation of MAPs may 
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have utility in treating presenilin or PS-interacting protein-associated diseases such as 
AD. As above, cells from normal or mutant animals or subjects, or the transformed 
cell lines and animal models of the invention may be employed. Preferred assays will 
employ cell lines or animal models transformed with a mutant human or humanized 
5 mutant presenilin or PS-interacting protein gene. The baseline phosphorylation state 
of MAPs in these cells may be established and then candidate compounds may be 
tested for their ability to prevent, inhibit or counteract the hyperphosphorylation 
associated with mutants. The phosphorylation state of the MAPs may be determined 
by any standard method known in the art but, preferably, antibodies which bind 
10 selectively to phosphorylated or unphosphorylated epitopes are employed. Such 
antibodies to phosphorylation epitopes of the tau protein are known in the art (e.g., 
ALZ50). 

10. Screening and Diagnostics for Alzheimer's Disease 

15 A. General Diagnostic Methods 

The PS-interacting genes and gene products, as well as the PS-interacting 
protein derived probes, primers and antibodies, disclosed or otherwise enabled herein, 
are useful in the screening for carriers of alleles associated with Alzheimer's Disease, 
for diagnosis of victims of Alzheimer's Disease, and for the screening and diagnosis of 

20 related presenile and senile dementias, psychiatric diseases such as schizophrenia and 
depression, and neurologic diseases such as stroke and cerebral hemorrhage, all of 
which are seen to a greater or lesser extent in symptomatic human subjects bearing 
mutations in the PS1 or PS2 genes or in the APP gene. Individuals at risk for 
Alzheimer's Disease, such as those with AD present in the family pedigree, or 

25 individuals not previously known to be at risk, may be routinely screened using 

probes to detect the presence of a mutant PS-interacting protein gene or protein by a 
variety of techniques. Diagnosis of inherited cases of these diseases can be 
accomplished by methods based upon the nucleic acids (including genomic and 
mRNA/cDNA sequences), proteins, and/or antibodies disclosed and enabled herein, 

30 including functional assays designed to detect failure or augmentation of the normal 
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presenilin or PS-interacting protein activity and/or the presence of specific new 
activities conferred by mutant PS -interacting proteins. Preferably, the methods and 
products are based upon the human nucleic acids, proteins or antibodies, as disclosed 
or otherwise enabled herein. As will be obvious to one of ordinary skill in the art, 
however, the significant evolutionary conservation of large portions of nucleotide and 
amino acid sequences, even in species as diverse as humans, mice, C. eleeans. and 
Drosophila , allow the skilled artisan to make use of non-human homologues of the 
PS-interacting proteins to produce useful nucleic acids, proteins and antibodies, even 
for applications directed toward human or other animal subjects. Thus, for brevity of 
exposition, but without limiting the scope of the invention, the following description 
will focus upon uses of the human homologues of PS-interacting proteins and genes. 
It will be understood, however, that homologous sequences from other species will be 
equivalent for many purposes. 

As will be appreciated by one of ordinary skill in the art, the choice of 
15 diagnostic methods of the present invention will be influenced by the nature of the 
available biological samples to be tested and the nature of the information required. 
Alzheimer's Disease is, of course, primarily a disease of the brain, but brain biopsies 
are invasive and expensive procedures, particularly for routine screening. Other 
tissues which express the presenilins or PS-interacting proteins at significant levels 
20 may, therefore, be preferred as sources for samples. 

B. Protein B ased Screens and Diagnostics 

When a diagnostic assay is to be based upon PS-interacting proteins, a variety 
of approaches are possible. For example, diagnosis can be achieved by monitoring 
differences in the electrophoretic mobility of normal and mutant proteins. Such an 
approach will be particularly useful in identifying mutants in which charge 
substitutions are present, or in which insertions, deletions or substitutions have 
resulted in a significant change in the electrophoretic migration of the resultant 
protein. Alternatively, diagnosis may be based upon differences in the proteolytic 
cleavage patterns of normal and mutant proteins, differences in molar ratios of the 
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various amino acid residues, or by functional assays demonstrating altered function of 
the gene products. 

In preferred embodiments, protein-based diagnostics will employ differences 
in the ability of antibodies to bind to normal and mutant PS-interacting proteins. Such 
5 diagnostic tests may employ antibodies which bind to the normal proteins but not to 
mutant proteins, or vice versa. In particular, an assay in which a plurality of 
monoclonal antibodies, each capable of binding to a mutant epitope, may be 
employed. The levels of anti-mutant antibody binding in a sample obtained from a 
test subject (visualized by, for example, radiolabelling, ELISA or chemi luminescence) 

10 may be compared to the levels of binding to a control sample. Alternatively, 
antibodies which bind to normal but not mutant proteins may be employed, and 
decreases in the level of antibody binding may be used to distinguish homozygous 
normal individuals from mutant heterozygotes or homozygotes. Such antibody 
diagnostics may be used for in situ immunohistochemistry using biopsy samples of 

15 CNS tissues obtained antemortem or postmortem, including neuropathological 

structures associated with these diseases such as neurofibrillary tangles and amyloid 
plaques, or may be used with fluid samples such a cerebrospinal fluid or with 
peripheral tissues such as white blood cells. 

C. Nucleic Acid Based Screens and Diagnostics 

20 When the diagnostic assay is to be based upon nucleic acids from a sample, 

the assay may be based upon mRNA, cDNA or genomic DNA. When mRNA is used 
from a sample, there are considerations with respect to source tissues and the 
possibility of alternative splicing. That is, there may be little or no expression of 
transcripts unless appropriate tissue sources are chosen or available, and alternative 

25 splicing may result in the loss of some information or difficulty in interpretation. 

Whether mRNA, cDNA or genomic DNA is assayed, standard methods well known in 
the art may be used to detect the presence of a particular sequence either in situ or in 
vitro (see, e.g., Sambrook et aL, (1989) Molecular Cloning: A Laboratory Manual . 
2nd ed., Cold Spring Harbor Press, Cold Spring Harbor, NY). As a general matter, 

30 however, any tissue with nucleated cells may be examined. 
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Genomic DNA used for the diagnosis may be obtained from body cells, such 
as those present in the blood, tissue biopsy, surgical specimen, or autopsy material. 
The DNA may be isolated and used directly for detection of a specific sequence or 
may be amplified by the polymerase chain reaction (PCR) prior to analysis. 
5 Similarly, RNA or cDNA may also be used, with or without PCR amplification. To 
detect a specific nucleic acid sequence, direct nucleotide sequencing, hybridization 
using specific oligonucleotides, restriction enzyme digest and mapping, PCR 
mapping, RNase protection, chemical mismatch cleavage, ligase-mediated detection, 
and various other methods may be employed. Oligonucleotides specific to particular 
10 sequences can be chemically synthesized and labeled radioactively or non- 
radioactive^ (e.g., biotin tags, ethidium bromide), and hybridized to individual 
samples immobilized on membranes or other solid-supports (e.g., by dot-blot or 
transfer from gels after electrophoresis), or in solution. The presence or absence of 
the target sequences may then be visualized using methods such as autoradiography, 
15 fluorometry, or colorimetry. These procedures can be automated using redundant, 
short oligonucleotides of known sequence fixed in high density to silicon chips. 
(1) Appropriate Probes and Primers 
Whether for hybridization, RNase protection, ligase-mediated detection, PCR 
amplification or any other standards methods described herein and well known in the 
20 art, a variety of subsequences of the PS-interacting protein sequences disclosed or 

otherwise enabled herein will be useful as probes and/or primers. These sequences or 
subsequences will include both normal sequences and deleterious mutant sequences. 
In general, useful sequences will include at least 8-9, more preferably 10-50, and most 
preferably 18-24 consecutive nucleotides from introns, exons or intron/exon 
25 boundaries. Depending upon the target sequence, the specificity required, and future 
technological developments, shorter sequences may also have utility. Therefore, any 
PS-interacting protein derived sequence which is employed to isolate, clone, amplify, 
identify or otherwise manipulate a PS-interacting protein sequence may be regarded as 
an appropriate probe or primer. Particularly contemplated as useful will be sequences 
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including nucleotide positions from the PS-interacting protein genes in which disease- 
causing mutations are known to be present, or sequences which flank these positions. 
(2) Hybridization Screening 
For in situ detection of a normal or mutant PS-interacting protein-related 
5 nucleic acid sequence, a sample of tissue may be prepared by standard techniques and 
then contacted with one or more of the above-described probes, preferably one which 
is labeled to facilitate detection, and an assay for nucleic acid hybridization is 
conducted under stringent conditions which permit hybridization only between the 
probe and highly or perfectly complementary sequences. Because many mutations 

10 consist of a single nucleotide substitution, high stringency hybridization conditions 
may be required to distinguish normal sequences from most mutant sequences. When 
the PS-interacting protein genotypes of the subject's parents are known, probes may 
be chosen accordingly. Alternatively, probes to a variety of mutants may be 
employed sequentially or in combination. Because most individuals carrying 

15 mutations in the PS-interacting proteins will be heterozygous, probes to normal 
sequences also may be employed and homozygous normal individuals may be 
distinguished from mutant heterozygotes by the amount of binding (e.g., by intensity 
of radioactive signal). In another variation, competitive binding assays may be 
employed in which both normal and mutant probes are used but only one is labeled. 

20 (3) Restriction Mapping 

Sequence alterations may also create or destroy fortuitous restriction enzyme 
recognition sites which are revealed by the use of appropriate enzyme digestion 
followed by gel-blot hybridization. DNA fragments carrying the site (normal or 
mutant) are detected by their increase or reduction in size, or by the increase or 

25 decrease of corresponding restriction fragment numbers. Such restriction fragment 
length polymorphism analysis (RFLP), or restriction mapping, may be employed with 
genomic DNA, mRNA or cDNA. The PS-interacting protein sequences may be 
amplified by PCR using the above-described primers prior to restriction, in which 
case the lengths of the PCR products may indicate the presence or absence of 

30 particular restriction sites, and/or may be subjected to restriction after amplification. 
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The restriction fragments may be visualized by any convenient means (e.g., under UV 
light in the presence of ethidium bromide). 
(4) PCR Mapping 

In another series of embodiments, a single base substitution mutation may be 
5 detected based on differential PCR product length or production in PCR. Thus, 
primers which span mutant sites or which, preferably, have 3' termini at mutation 
sites, may be employed to amplify a sample of genomic DNA, mRNA or cDNA from 
a subject. A mismatch at a mutational site may be expected to alter the ability of the 
normal or mutant primers to promote the polymerase reaction and, thereby, result in 
10 product profiles which differ between normal subjects and heterozygous and/or 
homozygous mutants. The PCR products of the normal and mutant gene may be 
differentially separated and detected by standard techniques, such as polyacrylamide 
or agarose gel electrophoresis and visualization with labeled probes, ethidium 
bromide or the like. Because of possible non-specific priming or readthrough of 
15 mutation sites, as well as the fact that most carriers of mutant alleles will be 
heterozygous, the power of this technique may be low. 
(5) Electrophoretic Mobility 
Genetic testing based on DNA sequence differences also may be achieved by 
detection of alterations in electrophoretic mobility of DNA, mRNA or cDNA 
20 fragments in gels. Small sequence deletions and insertions, for example, can be 

visualized by high resolution gel electrophoresis of single or double stranded DNA, or 
as changes in the migration pattern of DNA heteroduplexes in non-denaturing gel 
electrophoresis. Mutations or polymorphisms in the PS-interacting protein genes may 
also be detected by methods which exploit mobility shifts due to single-stranded 
25 conformational polymorphisms (SSCP) associated with mRNA or single-stranded 
DNA secondary structures. 

(6) Chemical Cleavage of Mismatches 
Mutations in the PS-interacting protein genes may also be detected by 
employing the chemical cleavage of mismatch (CCM) method (see, e.g., Saleeba and 
Cotton, 1993, and references therein). In this technique, probes (up to - 1 kb) may be 
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mixed with a sample of genomic DNA, cDNA or mRNA obtained i5x>m a subject. 
The sample and probes are mixed and subjected to conditions which allow for 
heteroduplex formation (if any). Preferably, both the probe and sample nucleic acids 
are double-stranded, or the probe and sample may be PCR amplified together, to 
5 ensure creation of all possible mismatch heteroduplexes. Mismatched T residues are 
reactive to osmium tetroxide and mismatched C residues are reactive to 
hydroxylamine. Because each mismatched A will be accompanied by a mismatched 
T, and each mismatched G will be accompanied by a mismatched C, any nucleotide 
differences between the probe and sample (including small insertions or deletions) 

10 will lead to the formation of at least one reactive heteroduplex. After treatment with 
osmium tetroxide and/or hydroxylamine to modify any mismatch sites, the mixture is 
subjected to chemical cleavage at any modified mismatch sites by, for example, 
reaction with piperidine. The mixture may then be analyzed by standard techniques 
such as gel electrophoresis to detect cleavage products which would indicate 

15 mismatches between the probe and sample. 

(7) Other Methods 
Various other methods of detecting PS-interacting protein mutations, based 
upon the sequences disclosed and otherwise enabled herein, will be apparent to those 
of ordinary skill in the art. Any of these may be employed in accordance with the 

20 present invention. These include, but are not limited to, nuclease protection assays 
(SI or ligase-mediated), ligated PCR, denaturing gradient gel electrophoresis (DGGE; 
see, e.g., Fischer and Lerman, 1983), restriction endonuclease fingerprinting 
combined with SSCP (REF-SSCP; see, e.g., Liu and Sommer, 1995), and the like. 
D. Other Screens and Diagnostics 

25 In inherited cases, as the primary event, and in non-inherited cases as a 

secondary event due to the disease state, abnormal processing of the presenilins, PS- 
interacting proteins, APP, or proteins reacting with the presenilins, PS-interacting 
proteins, or APP may occur. This can be detected as abnormal phosphorylation, 
glycosylation, glycation amidation or proteolytic cleavage products in body tissues or 

30 fluids (e.g., CSF or blood). 
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Diagnosis also can be made by observation of alterations in transcription, 
translation, and post-translational modification and processing, as well as alterations 
in the intracellular and extracellular trafficking of gene products in the brain and 
peripheral cells. Such changes will include alterations in the amount of messenger 
5 RNA and/or protein, alteration in phosphorylation state, abnormal intracellular 
location/distribution, abnormal extracellular distribution, etc. Such assays will 
include: Northern Blots (e.g., with PS-interacting protein-specific and non-specific 
nucleotide probes), Western blots and enzyme-linked immunosorbent assays (ELISA) 
(e.g., with antibodies raised specifically to a PS-interacting protein or PS-interacting 

10 functional domain, including various post-translational modification states including 
glycosylated and phosphorylated isoforms). These assays can be performed on 
peripheral tissues (e.g., blood cells, plasma, cultured or other fibroblast tissues, etc.) 
as well as on biopsies of CNS tissues obtained antemortem or postmortem, and upon 
cerebrospinal fluid. Such assays might also include in situ hybridization and 

15 immunohistochemistry (to localize messenger RNA and protein to specific subcellular 
compartments and/or within neuropathological structures associated with these 
diseases such as neurofibrillary tangles and amyloid plaques). 
E. Screening and Diagnostic Kits 

In accordance with the present invention, diagnostic kits are also provided 
20 which will include the reagents necessary for the above-described diagnostic screens. 
For example, kits may be provided which include antibodies or sets of antibodies 
which are specific to one or more mutant epitopes. These antibodies may, in 
particular, be labeled by any of the standard means which facilitate visualization of 
binding. Alternatively, kits may be provided in which oligonucleotide probes or PCR 
25 primers, as described above, are present for the detection and/or amplification of 
normal or mutant presenilin and/or PS-interacting protein nucleotide sequences. 
Again, such probes may be labeled for easier detection of specific hybridization. As 
appropriate to the various diagnostic embodiments described above, the 
oligonucleotide probes or antibodies in such kits may be immobilized to substrates 
30 and appropriate controls may be provided. 



SUBSTITUTE SHEET (RULE 26) 



WO 97/27296 PCT/CA97/00051 



-74- 



11. Methods of Treatment 

The present invention now provides a basis for therapeutic intervention in 
diseases which are caused, or which may be caused, by mutations in the PS- 
5 interacting proteins. As noted above, mutations in the hPSl and hPS2 genes have 
been associated with the development of early onset forms of Alzheimer's Disease 
and, therefore, the present invention is particularly directed to the treatment of 
subjects diagnosed with, or at risk of developing, Alzheimer's Disease. 

Without being bound to any particular theory of the invention, the effect of 

10 the Alzheimer's Disease related mutations in the presenilins appears to be a gain of a 
novel function, or an acceleration of a normal function, which directly or indirectly 
causes aberrant processing of the Amyloid Precursor Protein (APP) into A(3 peptide, 
abnormal phosphorylation homeostasis, and/or abnormal apoptosis in the brain. Such 
a gain of function or acceleration of function model would be consistent with the adult 

15 onset of the symptoms and the dominant inheritance of Alzheimer's Disease. 

Nonetheless, the mechanism by which mutations in the presenilins may cause these 
effects remains unknown. 

The present invention, by identifying a set of PS-interacting proteins, 
provides new therapeutic targets for intervening in the etiology of presenilin-related 

20 AD. In addition, as mutations in the presenilins may cause AD, it is likely that 

mutations in the PS-interacting proteins may also cause AD. The fact that the PS- 
interacting protein S5a is alternately processed in the brains of victims of sporadic 
AD, as well as in the brains of victims of presenilin-linked AD, suggests that, at the 
very least, this PS-interacting protein is involved in the etiology of AD independent of 

25 mutations in the presenilins. It is likely that the other PS-interacting proteins also 
may be involved in non-presenilin-linked AD. 

Therapies to treat PS-interacting protein-associated diseases such as AD 
may be based upon (1) administration of normal PS-interacting proteins, (2) gene 
therapy with normal PS-interacting protein genes to compensate for or replace the 

30 mutant genes, (3) gene therapy based upon antisense sequences to mutant PS- 
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interacting protein genes or which "knock-out" the mutant genes, (4) gene therapy 
based upon sequences which encode a protein which blocks or corrects the deleterious 
effects of PS-interacting protein mutants, (5) immunotherapy based upon antibodies to 
normal and/or mutant PS-interacting proteins, or (6) small molecules (drugs) which 
5 alter PS -interacting protein expression, alter interactions between PS-interacting 

proteins and other proteins or ligands, or which otherwise block the aberrant function 
of mutant presenilin or PS-interacting proteins by altering the structure of the mutant 
proteins, by enhancing their metabolic clearance, or by inhibiting their function. 
A. Protein Therapy 

10 Treatment of Alzheimer's Disease, or other disorders resulting from PS- 

interacting protein mutations, may be performed by replacing the mutant protein with 
normal protein, by modulating the function of the mutant protein, or by providing an 
excess of normal protein to reduce the effect of any aberrant function of the mutant 
proteins. 

15 To accomplish this, it is necessary to obtain, as described and enabled 

herein, large amounts of substantially pure PS-interacting protein from cultured cell 
systems which can express the protein. Delivery of the protein to the affected brain 
areas or other tissues can then be accomplished using appropriate packaging or 
administration systems including, for example, liposome mediated protein delivery to 

20 the target cells. 

B. Gene Therapy 

In one series of embodiments, gene therapy may be employed in which 
normal copies of a PS-interacting protein gene are introduced into patients to code 
successfully for normal protein in one or more different affected cell types. The gene 
must be delivered to those cells in a form in which it can be taken up and code for 
sufficient protein to provide effective function. Thus, it is preferred that the 
recombinant gene be operably joined to a strong promoter so as to provide a high 
level of expression which will compensate for, or out-compete, the mutant proteins. 
As noted above, the recombinant construct may contain endogenous or exogenous 
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regulatory elements, inducible or repressible regulatory elements, or tissue-specific 
regulatory elements. 

In another series of embodiments, gene therapy may be employed to 
replace the mutant gene by homologous recombination with a recombinant construct. 
5 The recombinant construct may contain a normal copy of the targeted PS-interacting 
protein gene, in which case the defect is corrected in situ, or may contain a "knock- 
out" construct which introduces a stop codon, missense mutation, or deletion which 
abolished function of the mutant gene. It should be noted in this respect that such a 
construct may knock-out both the normal and mutant copies of the targeted gene in a 

10 heterozygous individual, but the total loss of gene function may be less deleterious to 
the individual than continued progression of the disease state. 

In another series of embodiments, antisense gene therapy may be 
employed. The antisense therapy is based on the fact that sequence-specific 
suppression of gene expression can be achieved by intracellular hybridization between 

15 mRNA or DNA and a complementary antisense species. The formation of a hybrid 
duplex may then interfere with the transcription of the gene and/or the processing, 
transport, translation and/or stability of the target mRNA. Antisense strategies may 
use a variety of approaches including the administration of antisense oligonucleotides 
or antisense oligonucleotide analogs (e.g., analogs with phosphorothioate backbones) 

20 or transfection with antisense RNA expression vectors. Again, such vectors may 
include exogenous or endogenous regulatory regions, inducible or repressible 
regulatory elements, or tissue-specific regulatory elements. 

In another series of embodiments, gene therapy may be used to introduce a 
recombinant construct encoding a protein or peptide which blocks or otherwise 

25 corrects the aberrant function caused by a mutant presenilin or PS-interacting protein 
gene. In one embodiment, the recombinant gene may encode a peptide which 
corresponds to a domain of a PS-interacting which has been found to abnormally 
interact with another cell protein or other cell ligand (e.g., a mutant presenilin). Thus, 
for example, if a mutant PS1 TM6-»7 domain is found to interact with a PS- 

30 interacting protein but the corresponding normal TM6->7 domain does not undergo 
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this interaction, gene therapy may be employed to provide an excess of the mutant 
TM6-*7 domain which may compete with the mutant presenilin protein and inhibit o 
block the aberrant interaction. Alternatively, the PS-interacting domain of a PS- 
interacting protein which interacts with a mutant, but not a normal, presenilin may be 
5 encoded and expressed by a recombinant construct in order to compete with, and 
thereby inhibit or block, the aberrant interaction. 

Retroviral vectors can be used for somatic cell gene therapy especially 
because of their high efficiency of infection and stable integration and expression. A 
full length PS-interacting protein gene, subsequences encoding functional domains of 

10 these proteins, or any of the other therapeutic peptides described above, can be cloned 
into a retroviral vector and expression may be driven from its endogenous promoter, 
from the retroviral long terminal repeat, or from a promoter specific for the target cell 
type of interest (e.g., neurons). Other viral vectors which can be used include adeno- 
associated virus, vaccinia virus, bovine papilloma virus, or a herpes virus such as 

15 Epstein-Barr virus. 

C. Immunotherapy 

Immunotherapy is also possible for Alzheimer's Disease. Antibodies may 
be raised to a normal or mutant PS -interacting protein (or a portion thereof) and are 
administered to the patient to bind or block an aberrant interaction (e.g., with a mutant 

20 presenilin) and prevent its deleterious effects. Simultaneously, expression of the 

normal protein product could be encouraged. Alternatively, antibodies may be raised 
to specific complexes between mutant or wild-type PS-interacting proteins and their 
interaction partners. 

A further approach is to stimulate endogenous antibody production to the 

15 desired antigen. Administration could be in the form of a one time immunogenic 

preparation or vaccine immunization. The PS-interacting protein or other antigen may 
be mixed with pharmaceutically acceptable carriers or excipients compatible with the 
protein. The immunogenic composition and vaccine may further contain auxiliary 
substances such as emulsifying agents or adjuvants to enhance effectiveness. 
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Immunogenic compositions and vaccines may be administered parenterally by 
injection subcutaneously or intramuscularly. 

D. Small Molecule Therapeutics 

As described and enabled herein, the present invention provides for a 
5 number of methods of identifying small molecules or other compounds which may be 
useful in the treatment of Alzheimer's Disease or other disorders caused by mutations 
in the presenilins or PS-interacting proteins. Thus, for example, the present invention 
provides for methods of identifying proteins which bind to normal or mutant PS- 
interacting proteins (aside from the presenilins). The invention also provides for 
0 . methods of identifying small molecules which can be used to disrupt aberrant 

interactions between mutant presenilins and/or PS-interacting proteins and such other 
binding proteins or other cell components. 

Examples 

5 Example 1 . Isolation of PS-interacting proteins bv two-hvbrid veast system. 

To identify proteins interacting with the presenilin proteins, a 
commercially available yeast two-hybrid kit ("Matchmaker System 2" from Clontech, 
Palo Alto, CA) was employed to screen a brain cDNA library for clones which 
interact with functional domains of the presenilins. In view of the likelihood that the 

0 TM6-*7 loop domains of the presenilins are important functional domains, partial 
cDNA sequences encoding either residues 266-409 of the normal PS1 protein or 
residues 272-390 of the normal PS2 protein were ligated in-frame into the EcoRI and 
BamHI sites of the pAS2-l fusion-protein expression vector (Clontech). The resultant 
fusion proteins contain the GAL4 DNA binding domain coupled in-frame either to the 

5 TM6-»7 loop of the PS1 protein or to the TM6->7 loop of the PS2 protein. These 
expression plasmids were co-transformed into S. cerevisiae strain Y190 together with 
a library of human brain cDNAs ligated into the pACT2 yeast fusion-protein 
expression vector (Clontech) bearing the GAM activation domain using modified 
lithium acetate protocols of the "Matchmaker System 2" yeast two-hybrid kit 

0 (Clontech, Palo Alto, CA). Yeast clones bearing human brain cDNAs which interact 
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with the TM6-+7 loop domain were selected for His- resistance by plating on SD 
minimal medium lacking histidine and for Pgal+ activation by color selection. The 
His+ pgal+ clones were then purged of the pAS2-l "bait" construct by culture in 
10^g/ml cyclohexamide and the unknown "trapped" inserts of the human brain 
cDNAs encoding PS-interacting proteins were isolated by PCR and sequenced. Of 6 
million initial transformants, 200 positive clones were obtained after His- selection, 
and 42 after Pgal+ color selection, carried out in accordance with the manufacturer's 
protocol for selection of positive colonies. Of these 42 clones there were several 
independent clones representing the same genes. 

To address the likelihood that mutations in the presenilins cause AD 
through the acquisition of a novel but toxic function (i.e., dominant gain of function 
mutation) which is mediated by a novel interaction between the mutant proteins and 
one or more other cellular proteins, the human brain cDNA library cloned into the 
pACT2 expression vector (Clontech) was re-screened using mutant TM6->7 loop 
domain sequences as described above and accoiding to manufacturer's protocols. In 
particular, mutant presenilin sequences corresponding to residues 260-409 of PS1 
TM6-*7 loop domains bearing mutations L286V, L392V and A290-319 were ligated 
in-frame into the GAL4 DNA-binding domain of the pAS2-l vector (Clontech) and 
used to screen the human brain cDNA:GAL4 activation domain library of pACT 
vectors (Clontech). Yeast were co-transformed, positive colonies were selected, and 
"trapped" sequences were recovered and sequenced as described above. In addition to 
some of the same sequences recovered with the normal TM6-*7 loop domains, 
several new sequences were obtained which reflect aberrant interactions of the mutant 
presenilins with normal cellular proteins. 

The recovered and sequenced clones corresponding to these PS-interacting 
proteins were compared to the public sequence databases using the BL:ASTN 
algorithm via the NCBI e-mail server. Descriptions of several of these clones follow: 

Antisecretory Factor/ Pr oteasome S5a Suhnnit Two overlapping clones 
(Y2H29 and Y2H31) were identified which correspond to a C-terminal fragment of a 
protein alternatively identified as Antisecretory Factor ("ASF") or the Multiubiquitin 
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chain binding S5a subunit of the 26S proteasome ("S5a") (Johansson et al. 1995; 
Ferrell et al., 1996). The complete nucleotide and amino acid sequences of the S5a 
subunit are available through the public databases under Accession number U5 1007 
and are reproduced here as SEQ ID NO: 1 and SEQ ID NO: 2. The nucleotide 
5 sequences of the Y2H29 and Y2H3 1 clones include nucleotides 35 1-1330 of SEQ ID 
NO: 1 and amino acid residues 70-377 of SEQ ID NO: 2. Thus, residues 70-377 of 
the full S5a subunit include the PS-interacting domain of this protein. Residues 206- 
377 of S5a contain certain motifs that are important for protein-protein interactions 
(Ferrell et aL, 1996). 

10 The PS 1 -S5a subunit interaction was directly re-tested for both wild type 

and mutant PS1 TM6-*7 loop (residues 260-409) by transforming Y187 yeast cells 
with the appropriate wild type or mutant (L286V, L392V or A290-3 19) cDNA ligated 
in-frame to the GAL4-DNA binding domain of pACT2. The A290-319 mutant fusion 
construct displayed autonomous Pgal activation in the absence of any S5a "target 

15 sequence" and, therefore, could not be further analyzed. In contrast, both the L286V 
and L392V mutant constructs interacted specifically with the S5a construct. 
Quantitative assays, however, showed that these interactions were weaker than those 
involving the wild type PS1 26(WW sequence and that the degree of interaction was 
crudely correlated with the age of onset of FAD. The difference in Pgal activation 

20 was not attributable to instability of the mutant PSl 26(M09 construct mRNAs or fusion 
proteins because Western blots of lysates of transformed yeast showed equivalent 
quantities of mutant or wild-type fusion proteins. 

Because one of the putative functions of S5a is to bind multi-ubiquitinated 
proteins, the PSl:S5a interaction observed in S. cerevisiae could arise either through 

25 yeast-dependent ubiquitination of the PSl 260-40g construct, or by direct interaction. The 
former would reflect a degradative pathway, a functional and perhaps reciprocal 
interaction between PS1 and S5a, or both. A direct interaction is favored by the fact 
that the PSl:S5a interaction is decreased rather than increased by the presence of the 
L286V and L392V mutations, and by the fact that neither of these mutations affect 

30 ubiquitin conjugation sites in the PS1 2WM09 loop (i.e., K265, K311, K314 or K395). 
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To further examine this possibility, we investigated the direct interaction of 
recombinant His-tagged fusion proteins corresponding to full length S5a and the 
PS1 2(KM09 loop. Partially purified recombinant His-tagged PS1 26(M0 , loop and His- 
tagged S5a proteins and appropriate controls were mixed in phosphate buffered saline. 
5 The mixture was then subjected to size exclusion chromatography, and eluates were 
examined by SDS-PAGE and Western blotting using anti-His-tag monoclonal 
antibodies (Quiagen). In the crude PS1 26(M09 loop preparation alone, the PS1 26(M09 loop 
eluted from the size exclusion column as a broad peak at 35 minutes. In the crude S5a 
preparation alone, S5a eluted at 25 minutes. However, when the crude PS1 2(MM09 loop 
10 and S5a preparations were mixed, there was a significant shift in the elution of PS1 260 . 
40, toward a higher molecular weight complex. Co-elution of S5a and PS 1 2<XM09 in the 
same fraction was confirmed by SDS-PAGE and Western blotting of fractions using 
the anti-His-tag antibody. These results are consistent with a ubiquitin-independent 
and, therefore, possibly functional interaction. 
15 GT24 and related genes with homolo gy to nl20/nlakoglobin family Five 

over-lapping clones (Y2H6, Y2H10b, Y2H17h2, Y2H24, and Y2H25) were obtained 
which interact with the normal PS I TM6-+7 loop domain and which appear to 
represent at least one novel gene. The Y2H24 clone was also found to interact with 
the mutant PS1 TM6->7 loop domains. Note that it appears that more than one 
20 member of the gene family was isolated, suggesting a family of genes interacting 
differentially with different presenilins. The most complete available cDNA 
corresponding to these clones was designated GT24 and is disclosed herein as SEQ ID 
NO: 3 and has been deposited with GenBank as Accession number U81004. The 
open reading frame suggests that GT24 is a protein of at least 1040 amino acids with a 
25 unique N-terminus, and considerable homology to several armadillo farm) repeat 
proteins at its C-terminus. The predicted amino acid sequence of GT24 is disclosed 
herein as SEQ ID NO: 4. Thus, for example, residues 440-862 of GT24 have 32-56% 
identity (p=1.2e" 3 ) to residues 440-854 of murine pl20 protein (Accession number 
Z17804), and residues 367-815 of GT24 have 26-42% identity (p=0.0017) to residues 
30 245-465 of the D. melanogaster armadillo segment polarity protein (Accession 
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number P18824). The GT24 gene maps to chromosome 5pl5 near the anonymous 
microsatellite marker D5S748 and the Cri-du-Chat syndrome locus. This sequence is 
also nearly identical to portions of two human ESTs of unknown function (i.e., 
nucleotides 2701-3018 of Accession number F08730 and nucleotides 2974-3348 of 
5 Accession number T18858). These clones also show lower degrees of homology with 
other partial cDNA and gDNA sequences (e.g., H17245, T06654, T77214, H24294, 
M62015, T87427 and G04019). 

p0071 gene . An additional His", PgaP clone isolated in the initial screening 
with wild type PSl 26 6-409 "bait" had a similar nucleotide sequence to GT24 (target 

10 clone Y2H25; Accession number U81005), and would also be predicted to encode a 
peptide with C-terminal arm repeats. A longer cDNA sequence closely corresponding 
to the Y2H25 clone has been deposited in GenBank as human protein p0071 
(Accession number X81889). The nucleotide and corresponding amino acid 
sequences of p0071 are reproduced herein as SEQ ID NOs: 5 and 6. Comparison of 

15 the predicted sequence of the p0071 ORF with that of GT24 confirms that they are 

related proteins with 47% overall amino acid sequence identity, and with 70% identity 
between residues 346-862 of GT24, and residues 509-1022 of p0071 (which includes 
residues encoded by the Y2H25 cDNA). The latter result strongly suggests that PS1 
interacts with a novel class of arm repeat containing proteins. The broad - 4 kb 

20 hybridization signal obtained on Northern blots with the unique 5* end of GT24 could 
reflect either alternate splicing/polyadenylation of GT24, or the existence of 
additional members of this family with higher degrees of N-terminal homology to 
GT24 than p0071. 

Rabll gene. This clone (Y2H9), disclosed herein as SEQ ID NO: 7, was 

25 identified as interacting with the normal PS1 TM6-»7 loop domain and appears to 
correspond to a known gene, Rabl 1, available through Accession numbers X56740 
and X53143. Rabl 1 is believed to be involved in protein/vesicle trafficking in the 
ER/Golgi. Note the possible relationship to processing of membrane proteins such as 
BAPP and Notch with resultant overproduction of toxic AB peptides (especially 

30 neurotoxic Afl,^ 2( 4 3> isoforms) (Scheuner, et al, 1995). 
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Retinoid X receptor-B gene This clone (Y2H23b), disclosed herein as 
SEQ ID NO: 8, was identified as interacting with the normal PS1 TM6-»7 loop 
domain and appears to correspond to a known gene, known variously as the retinoid X 
receptor-p, nuclear receptor co-regulator or MHC Class I regulatory element, and 
5 available through Accession numbers M84820, X63522 and M8 1 766. This gene is 
believed to be involved in intercellular signaling, suggesting a possible relationship to 
the intercellular signaling function mediated by C. gjegang sell 2 and Notch/lin-12 
(transcription activator). 

Cytoplasmic chaperonin gene This clone (Y2H27), disclosed herein as 
10 SEQ ED NO: 9, was identified as interacting with the normal PS 1 TM6-*7 loop 
domain and appears to correspond to a known gene, a cytoplasmic chaperonin 
containing TCP-1, available through Accession numbers U17104 and X74801. 

Unknown gene fY2H3S). This clone (Y2H35), disclosed herein as SEQ 
ID NO: 10, was identified as interacting with the normal PS1 TM6-»-7 loop domain 
15 and appears to correspond to a known gene of unknown function, available through 
Accession number R12984, which shows conservation down through yeast. 

Unknown gene (Y2H171 ). This clone (Y2H1 71 ), disclosed herein as SEQ 
ID NO: 1 1, was identified as interacting with the normal PS1 TM6-+7 loop domain 
and appears to correspond to a known expressed repeat sequence available through 
20 Accession number D55326. 

Unknown gene (Y2H41). This clone (Y2H41 ) was identified which reacts 
strongly with the TM6-+7 loop domains of both PS1 and PS2 as well as the mutant 
loop domains of PS1. The sequence, disclosed as SEQ ID NO: 12, shows strong 
homology to an EST of unknown function (Accession number T64843). 
25 Example 2. Isolation of presenilin hindinp proteins hv affinity chromatograp hy 
To identify the proteins which may be involved in the biochemical 
function of the presenilins, PS-interacting proteins were isolated using affinity 
chromatography. A GST-fusion protein containing the PS1 TM6-+7 loop, prepared 
as described in Example 3, was used to probe human brain extracts, prepared by 
30 homogenizing brain tissue by Polytron in physiological salt solution. Non-specific 
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binding was eliminated by pre-clearing the brain homogenates of endogenous GST- 
binding components by incubation with glutathione-Sepharose beads. These GST- 
free homogenates were then incubated with the GST-PS fusion proteins to produce the 
desired complexes with functional binding proteins. These complexes were then 
5 recovered using the affinity glutathione-Sepharose beads. After extensive washing 
with phosphate buffered saline, the isolated collection of proteins was separated by 
SDS-polyacrylamide gel electrophoresis (SDS-PAGE; Tris-tricine gradient gel 4- 
20%). Two major bands were observed at -14 and 20 kD in addition to several 
weaker bands ranging from 50 to 60 kD. 
10 The same approach may now be used to identify proteins which have 

binding activity for the PS-interacting proteins and, thereby, to further elucidate the 
etiology of AD and to identify additional therapeutics targets for intervention in AD 
and related disorders. 

Example 3. Eukarvotic and prokarvotic expression vector systems. 

15 Constructs suitable for use in eukaryotic and prokaryotic expression 

systems have been generated using different classes of PS1 nucleotide cDNA 
sequence inserts. In the first class, termed full-length constructs, the entire PS1 cDNA 
sequence is inserted into the expression plasmid in the correct orientation, and 
includes both the natural 5' UTR and 3 1 UTR sequences as well as the entire open 

20 reading frame. The open reading frames bear a nucleotide sequence cassette which 

allows either the wild type open reading frame to be included in the expression system 
or alternatively, single or a combination of double mutations can be inserted into the 
open reading frame. This was accomplished by removing a restriction fragment from 
the wild type open reading frame using the enzymes Narl and Pflml and replacing it 

25 with a similar fragment generated by reverse transcriptase PCR and bearing the 

nucleotide sequence encoding either the M146L mutation or the H163R mutation. A 
second restriction fragment was removed from the wild type normal nucleotide 
sequence for the open reading frame by cleavage with the enzymes Pflml and Ncol 
and replaced with a restriction fragment bearing the nucleotide sequence encoding the 

30 A246E mutation, the A260V mutation, the A285 V mutation, the L286V mutation, the 
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L392V mutation or the C410Y mutation. A third variant, bearing a combination of 
either the M146L or H163R mutation in tandem with one of the remaining mutations, 
was made by linking a Narl-Pflml fragment bearing one of the former mutations and a 
Pflml-Ncol fragment bearing one of the latter mutations. 
5 The second class of cDNA inserts, termed truncated constructs, was 

constructed by removing the 5' UTR and part of the 3' UTR sequences from full 
length wild type or mutant cDNA sequences. The 5' UTR sequence was replaced with 
a synthetic oligonucleotide containing a Kpnl restriction site (GGTAC/C) and a small 
sequence (GCCACC) to create a Kozak initiation site around the ATG at the 
10 beginning of the PS 1 ORF. The 3' UTR was replaced with an oligonucleotide with an 
artificial EcoRI site at the 5' end. Mutant variants of this construct were then made by 
inserting the mutant sequences described above at the Narl-Pflml and Pslml-Ncol 
sites as described above. 

For eukaryotic expression, these various cDNA constructs bearing wild 
15 type and mutant sequences, as described above, were cloned into the expression 

vector pZeoSV in which the SV60 promoter cassette had been removed by restriction 
digestion and replaced with the CMV promoter element of pcDNA3 (Invitrogen). For 
prokaryotic expression, constructs have been made using the glutathione S-transferase 
(GST) fusion vector pGEX-kg. The inserts which have been attached to the GST 
20 fusion nucleotide sequence are the same nucleotide sequences described above 
bearing either the normal open reading frame nucleotide sequence, or bearing a 
combination of single and double mutations as described above. These GST fusion 
constructs allow expression of the partial or full-length protein in prokaryotic cell 
systems as mutant or wild type GST fusion proteins, thus allowing purification of the 
full-length protein followed by removal of the GST fusion product by thrombin 
digestion. A further cDNA construct was made with the GST fusion vector, to allow 
the production of the amino acid sequence corresponding to the hydrophilic acidic 
loop domain between TM6 and TM7 of the full-length protein, either as a wild type 
nucleotide sequence or as a mutant sequence bearing either the A285V mutation, the 
L286V mutation or the L392V mutation. This was accomplished by recovering wild 



25 



30 
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type or mutant sequence from appropriate sources of RNA using a 5' oligonucleotide 
primer with a 5* BamHI restriction site (G/GATCC), and a 3 1 primer with a 5' EcoRI 
restriction site (G/AATTC). This allowed cloning of the appropriate mutant or wild 
type nucleotide sequence corresponding to the hydrophilic acidic loop domain at the 
5 BamHI and the EcoRI sites within the pGEX-KG vector. 

The PS-interacting protein genes may be similarly manipulated by 
recombinant means for expression in prokaryotic or eukaryotic hosts. In particular, 
GST or other fusion proteins may be produced which will be useful in assays (e.g., 
yeast two-hybrid studies) for therapeutics. 
10 Example 4. Antibody production. 

Peptide antigens corresponding to portions of the PS1 protein were 
synthesized by solid-phase techniques and purified by reverse phase high pressure 
liquid chromatography. Peptides were covalently linked to keyhole limpet 
hemocyanin (KLH) via disulfide linkages that were made possible by the addition of a 
15 cysteine residue at the peptide C-terminus of the presenilin fragment. This additional 
residue does not appear normally in the protein sequence and was included only to 
facilitate linkage to the KLH molecule. 

A total of three New Zealand white rabbits were immunized with peptide- 
KLH complexes for each peptide antigen in combination with Freund's adjuvant and 
20 were subsequently given booster injections at seven day intervals. Antisera were 
collected for each peptide and pooled and IgG precipitated with ammonium sulfate. 
Antibodies were then affinity purified with Suifo-link agarose (Pierce) coupled with 
the appropriate peptide. This final purification is required to remove non-specific 
interactions of other antibodies present in either the pre- or post-immune serum. 
25 The specificity of each antibody was confirmed by three tests. First, each 

detected single predominant bands of the approximate size predicted for presenilin- 1 
on Western blots of brain homogenate. Second, each cross-reacted with recombinant 
fusion proteins bearing the appropriate sequence. Third each could be specifically 
blocked by pre-absorption with recombinant PS1 or the immunizing peptide. 
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Antibodies to peptides derived from the PS -interacting proteins may be 
produced by similar means. 
Example 5. Transgenic mice. 

A series of wild type and mutant PS 1 and PS2 genes were constructed for 
5 use in the preparation of transgenic mice. Mutant versions of PS 1 and PS2 were 
generated by site-directed mutagenesis of the cloned cDNAs using standard 
techniques. 

The cDNAs and their mutant versions were used to prepare two classes of 
mutant and wild type PS1 and PS2 cDNAs, as described in Example 3. The first 

10 class, referred to as "full-length" cDNAs, were prepared by removing approximately 
200 bp of the 3' untranslated region immediately before the polyA site by digestion 
with EcoRI (PS1) or PvuII (PS2). The second class, referred to as "truncated" 
cDNAs, were prepared by replacing the 5' untranslated region with a ribosome 
binding site (Kozak consensus sequence) placed immediately 5' of the ATG start 

15 codon. 

Various full length and truncated wild type and mutant PS1 and PS2 
cDNAs, prepared as described above, were introduced into one or more of the 
following vectors and the resulting constructs were used as a source of gene for the 
production of transgenic mice. 

20 The cos.TET express ion vector : This vector was derived from a cosmid 

clone containing the Syrian hamster PrP gene. It has been described in detail by Scott 
etaL (1992) and Hsiao et al. (1995). PS1 and PS2 cDNAs (full length or truncated) 
were inserted into this vector at its Sail site. The final constructs contain 20 kb of 5' 
sequence flanking the inserted cDNA. This 5' flanking sequence includes the PrP 

25 gene promoter, 50 bp of a PrP gene 5 1 untranslated region exon, a splice donor site, a 1 
kb intron, and a splice acceptor site located immediately adjacent to the Sail site into 
which the PS1 or PS2 cDNA was inserted. The 3' sequence flanking the inserted 
cDNA includes an approximately 8 kb segment of PrP 3' untranslated region 
including a polyadenylation signal. Digestion of this construct with NotI (PS1) or 

30 Fsel (PS2) released a fragment containing a mutant or wild type PS gene under the 
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control of the PrP promoter. The released fragment was gel purified and injected into 
the pronuclei of fertilized mouse eggs using the method of Hsiao et al. (1995). 

Platelet-derived growth factor receptor B-subunit constructs : PS cDNAs 
were also introduced between the Sail (fiill length PS1 cDNAs) or Hindin (truncated 
5 PS1 cDNAs, full length PS2 cDNAs, and truncated PS2 cDNAs) at the 3' end of the 
human platelet derived growth factor receptor (3-subunit promoter and the EcoRI site 
at the 5' end of the SV40 polyA sequence and the entire cassette was cloned into the 
pZeoSV vector (Invitrogen, San Diego, CA.). Fragments released by Scal/BamHI 
digestion were gel purified and injected into the pronuclei of fertilized mouse eggs 

10 using the method of Hsiao et al. ( 1 995). 

Human p-actin constructs : PS1 and PS2 cDNAs were inserted into the Sail 
site of pBAcGH. The construct produced by this insertion includes 3.4 kb of the 
human p actin 5* flanking sequence (the human P actin promoter, a spliced 78 bp 
human p actin 5* untranslated exon and intron) and the PS1 or PS2 insert followed by 

15 2.2 kb of human growth hormone genomic sequence containing several introns and 
exons as well as a polyadenylation signal. Sfil was used to release a PS-containing 
fragment which was gel purified and injected into the pronuclei of fertilized mouse 
eggs using the method of Hsiao et al. (1995). 

Phosphoglycerate kinase constructs : PS1 and PS2 cDNAs were introduced 

20 into the pkJ90 vector. The cDNAs were inserted between the Kpnl site downstream 
of the human phosphoglycerate kinase promoter and the Xbal site upstream of the 3* 
untranslated region of the human phosphoglycerate kinase gene. PvuII/Hindlll (PS1 
cDNAs) or PvuII (PS2 cDNAs) digestion was used to release a PS-containing 
fragment which was then gel purified and injected into the pronuclei of fertilized 

25 mouse eggs as described above. 

Analysis of AB in transgenic murine hippocampus : To analyze the effect 
of a mutant human PS1 transgene in mice, a PS1 mutation observed in conjunction 
with a particularly severe form of early-onset PS 1 -linked Alzheimer's disease was 
used, namely the M146L missense mutation (Sherrington et al., 1995). The animals, 

30 which were heterozygous for the PS1 mutant transgene on a mixed FVB-C57BL/6 
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strain background, were cross-bred with similar mice bearing the human wild-type 
PAPP W5 cDNA under the same Syrian hamster PrP promoter similar to those animals 
recently described by Hsiao et al., 1995. These cross breedings were done because it 
is thought that human Ap is more susceptible to the formation of aggregates than are 
5 murine AP peptides. 

The progeny of these PS1 M146L x pAPP w cross-breedings were then 
genotyped to identify animals that contained both the human wild-type PAPP WS 
transgene and also the mutant human PS1 MU6L transgene. These mice were aged until 
two to three months of age and then sacrificed, with the hippocampus and neocortex 
10 being dissected rapidly from the brain and frozen. Litter mates of these mice, which 
contained only the wild-type human pAPP 495 transgene were also sacrificed, and their 
hippocampi and neocortices were dissected and rapidly frozen as well. 

The concentration of both total Ap peptides (Ap XJ|0 and Ap XJ|2(43) ) as well 
as the subset of Ap peptides ending on residues 42 or 43 (long-tailed Ap 4J peptides) 
15 were then measured using a two-sandwich ELISA as described previously (Tamaoka 
et al., 1994; Suzuki et al., 1994). These results convincingly showed a small increase 
in total AP peptides in the double transgenic animals bearing wild-type human 
PAPP 695 and mutant human PS1 MU6L transgenes compared to the wild-type human 
PAPP WJ controls. More impressively, these measurements also showed that there was 
20 an increase in the amount of long-tailed Ap peptides ending on residues 42 or 43 

(Ap 42 ). In contrast, litter mates bearing only the wild-type human pAPP WJ transgene 
had Ap 42 long-tailed peptide values which were below the limit of quantitation 
("BLQ"). 

These observations therefore confirm that the construction of transgenic 
25 animals can recapitulate some of the biochemical features of human Alzheimer's 
disease (namely the overproduction of Ap peptide and, in particular, overproduction 
of long-tailed isoforms of Ap peptide). These observations thus prove that the 
transgenic models are in fact useful in exploring therapeutic targets relevant to the 
treatment and prevention of Alzheimer's disease. 
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Analvsis of hippocampus dependent memory functions in PS1 transgenic 
mice : Fourteen transgenic C57BL/6 x FVB mice bearing the human PS1 MM6V mutant 
transgene under the PrP promoter (as described) above and 12 wild type litter mates 
aged 2.5-3 months of age (both groups were balanced for age, weight, and sex) were 
5 investigated for behavioral differences attributable to the mutant transgene. Also the 
qualitative observation of murine behavior in their home cages did not indicate 
bimodal distribution of behaviors in the sample of animals. 

Experiment 1 . To test for subtle differences in exploratory 
behavior (e*g. locomotion, scanning of the environment through rearing, and patterns 

10 of investigation of unfamiliar environment), both PS1 MI46V and wild type litter mates 
were tested in the open-field (Janus, et al. 1995). The results of the test revealed no 
significant differences between transgenics and controls in exploration of a new 
environment measured by mice locomotor behaviors (walking, pausing, wall leaning, 
rearing, grooming), (F(l,24) = .98, NS). Thus, differences any in behavior on the 

15 Morris water maze test (see below) cannot be attributed to differences in locomotor 
abilities, etc. 

Experiment 2 . One week after the open-field test, the PS1 MI46V 
mutant transgenic mice and their litter mates were trained in the Morris water maze. 
In this test, a mouse has to swim in a pool in order to find a submerged escape 

20 platform. The animal solves that test through learning the location of the platform 
using the available extra-maze spatial cues (Morris, 1990). This test was chosen 
because there is strong evidence that the hippocampal formation is involved in this 
form of learning. The hippocampus is also a major site of AD neuropathology in 
humans and defects in spatial learning (geographic disorientation, losing objects, 

25 wandering, etc.) are prominent early features of human AD. As a result the test is 

likely to detect early changes equivalent to those seen in human AD. The Morris test 
is conducted in three phases. In the first phase (the learning acquisition phase), the 
mouse has to learn the spatial position of the platform. In the second phase (the probe 
trial), the platform is removed from the pool and the mouse's search for the platform 

30 is recorded. In the final phase (the learning transfer phase), the platform is replaced in 
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a new position in the pool, and the mouse has to leani that new spatial position of the 
platfonn. 

Transgenic and wild type mice did not differ in their latencies to find the 
platform during learning acquisition (F(l,24) = 0.81, NS), and both groups showed 
rapid learning across trials (F(10,15) = 1 1.57, p < 0.001). During the probe trial 
phase, mice from both groups searched the quadrant of the pool which originally 
contained the platform significantly longer than other areas of the pool which had not 
contained the platform (F(3,22) = 28.9, p < 0.001). However, the wild type controls 
showed a trend which was not quite statistically significant (t(24) - 1.21, p = 0.24) for 
an increased number of crossings of the exact previous position of the platform. In 
the learning transfer test, both groups showed the same latency of finding the new 
position of the platform in the initial block of trials (t(24) =1.11, NS). Such long 
latency to find the new spatial position is expected because the mice spent most of 
their time searching for the platform in the old spatial position. However, in later 
trials in the learning transfer phase, the wild type mice showed shorter swim latencies 
to the new position of the platform compared to the PS1 MM6V mutant transgenics 
(F(U4) 2.36, p = 0.14). The results indicate that PS 1 MM4V mutant transgenic mice 
were less flexible in transferring learned information to a new situation and tended to 
persevere in their search for the platform in the old location. 

Thus, although no differences were found in the spontaneous exploration 
of a new environment and in the acquisition of new spatial information between the 
wild type and the PS1 M146V mutant transgenic mice, the PS1 M146V mutant transgenic 
mice were impaired in switching and/or adapting this knowledge in later situations. 

Electrophysiological Recorders in the hiDDocanmi.s nf mutant tranvmir 
mice: Five to six months old litter mate control and human PS1 MU6V mutant 
transgenic mice on the same C57BL/6 x FVB strain backgrounds as above were used 
to study long term potentiation (LTP) as an electrophysiologic correlate of learning 
and memory in the hippocampus. Recordings were carried out on 400 urn thick 
hippocampal slices according to conventional techniques. Briefly, brains were 
removed and transverse sections containing hippocampi were obtained within 1 min. 
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after mice were decapitated under halothane anesthesia. Slices were kept at room 
temperature in oxygenated artificial cerebrospinal fluid for one hour prior to 
recording. One slice at a time was transferred to the recording chamber, where they 
were maintained at 32 °C in an interface between oxygenated artificial cerebrospinal 
5 fluid and humidified air. Slices were then allowed to equilibrate in the recording 
chamber for another hour. 

Extracellular field recordings were carried out in the CA1 subfield of the 
hippocampus at the Schaeffer collateral-pyramidal cell synapse. Synaptic responses 
were induced by the stimulation of Schaeffer collaterals at a frequency of 0.03 Hz and 

10 an intensity of 30-50 % of maximal response. Tetani to evoke long-term potentiation 
consisted of 5 trains of 100 Hz stimulation lasting for 200 ms at an intertrain interval 
of 10 seconds. Field potentials were recorded using an Axopatch 200B amplifier 
(Axon Instrument). Glass pipettes were fabricated from boro silicate glass with an 
outer diameter of 1 .5 mm, and pulled with a two step Narishige puller. Data were 

15 acquired on a 486-B3M compatible computer using PCLAMP6 software (Axon 
Instrument). 

To test for any abnormality in presynaptic function, we investigated the 
differences in paired-pulse facilitation, which is an example of use-dependent increase 
in synaptic efficacy and is considered to be presynaptic in origin. In hippocampus, 

20 when two stimuli are delivered to the Schaeffer collaterals in rapid succession, paired- 
pulse facilitation manifests itself as an enhanced dendritic response to the second 
stimulus as the interstimulus interval gets shorter. In three pairs of wild- 
type/transgenic mice, we did not observe any difference in the paired-pulse facilitation 
over an interstimulus interval range of 20 ms to 1 sec. These data suggest that in 

25 PS1 MI46V mutant transgenic mice, the excitability of Schaeffer collateral fibers and 
neurotransmitter release are likely to be normal. 

Tetanic stimulation induced a long-lasting increase in the synaptic strength 
in both control (n = 3) and PS 1 M146V mutant transgenic mice (n = 2). In slices obtained 
from the PS1 MU6V mutant transgenic mice, long-lasting increase in the synaptic 

30 strength was 30.% more than that obtained from control mice. 
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Although preferred embodiments of the invention have been described 
herein in detail, it will be understood by those skilled in the art that variations may be 
made thereto without departing from the spirit of the invention or the scope of the 
5 appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

<i) APPLICANT: ST. GEORGE-HYSLOP, PETER H. 

ROMMENS, JOHANNA M . 
FRASER, PAUL E. 

<ii) TITLE OF INVENTION: NUCLEI C ACIDS AND PROTEINS RELATED TO 
ALZHEIMER'S DISEASE, AND USES THEREFOR 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sim & McBurney 

(B) STREET: 330 University Avenue, 6 TK Floor 
(C> CITY: Toronto 

(D) STATE: Ontario 

{ E ) COUNTRY: Canada 

(F) ZIP: MSG 1R7 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C> OPERATING SYSTEM: PC - DOS /MS -DOS 

<D) SOFTWARE: Patentln Release #1.0, Version #1.30 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 27-JAN-1997 

(C) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/592,541 

(B) FILING DATE: 26-JAN-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/021,673 

(B) FILING DATE: 05-JUL-1996 

(vii) PRIOR APPLICATION DATA : 

(A) APPLICATION NUMBER: US 60/021,700 

(B) FILING DATE: 12-JUL-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/029,895 

(B) FILING DATE: 08-NOV-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DOCKET • CAN-006PR(d) 

(B) FILING DATE: 02-JAN-1997 

(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: RAE, Patricia A . 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (416) 595-1155 

(B) TELEFAX: (416) 595-1163 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1330 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) - FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 145.. 1275 

(D) OTHER INFORMATION: /product' "S5a w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

AATTCCCAAA TGACCTTTT A TTTC AT AC AG AG AT AC A A AG GCAACTATGT GCAGCAACAA 60 

TCTGATGGGC AGTCCAAACT CTTGGGAGGA AGTAAATTC A TGGTAAATGT CATGATGGCG 120 

GTCGGGAGGG AGGAAGGTGG CAAG ATG GTG TTG GAA AGC ACT ATG GTG TGT 17 1 

Met val Leu Glu Ser Thr Met Val Cys 
1 5 



10 H 



30 



CGG 
Arg 


AAT 
Asn 


GGA 
Gly 


GAC 

*§& 


TTC 
Phe 


TTA 

Leu 


CCC 
Pro 


ACC 
Thr 


AGG 
Arg 
25 


219 


GTC 
Val 


AAC 
Asn 


ATA 
He 
35 


GTT 
Val 


TGT 
Cys 


CAT 
His 


TCA 
Ser 


AAG 
Lys 
40 


ACC 
Thr 


267 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS- 

IS! iv2?™ ; 377 amino ^Cids 
(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

<ii) MOLECULE TYPE: protein 



507 



555 



-97- 

CGC AGC AAC CCT GAG A AC AAC GTG CGC CTT ATC ACA CTG GCT A AT far t i * 

Arg Ser Asn Pro Giu Asn Asn Val Gig Leu lie ¥hr Hu All Asn Asp 315 

in m ^ *b Hu m m 5si m m nl n* ^ 

£55 S?J *SI §15 iM ?« if 5 25S MS ^ KS S?S *SS gj$ 

8 0 8 5 

*is S5s si? a" ai sis si a- 3- as gj as as «s a* n as 
E2 si? m ?is *ri ss ki sij ss as c ss si? as as ai as 

£52 tn SIS 51? S?S Kg Ail t% gg SIS eJS 2?S as S*S ?a ai 

sir as ?n as Asn in as as as as si? $|j as as as 

? C S III SSI 51* asS ?SS g| Ml SS £5* Sap g|j is" ss; is; si; 

ks 51? $a ssi SSI i?? sss ss; "s si a? si as ?is sj; 
k; sss ki sis fg a; as m m ss Sis sis g?j as sg ss 
ss; as m as in a? §i? Asp sss a; si ss? ss; a; as ss 
ns ss ai ss? 51? is? sit as as as ss as fsj as as as 
as ss sss ssi ss si ss si is? si si as as a? ai si 
fit «J -1 as a§ is? a*p ss as as $s si? sss jg 
as as as as gi as sss a; 22? gi ss" as sis as a; si? 
a; as as as as ?n ss ?$; ss si? as sis iss as as ss 
as as jjj as c ?s ss as iss ss as ai ss ss is sa ai 1083 
sis 111 ms iss as sss ss ss as as a g s p t aj iss as 51? si? 



891 



93 9 



1179 



\\l ts ns ns tit it; is; s:$ f» gjs as £:s «« % n «? 
ss; ns as »: ft; sss tn ss »* jg »s §?? is? £*= m J« ».* 
a: ns ss jg if? & jjs ^ j;; m & ^ «»= 

TGAGACTGG A GGGAAAGGGT AGCTGAGTCT GCTT AGGGAC TGCATGGGGG AATTC 



1330 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Val Leu Glu Ser Thr Met Val Cys Val Asp Asn Ser Glu Tyr Met 

1 5 10 15 - 

Arg Asn Gly Asp Phe Leu Pro Thr Arg Leu Gin Ala Gin Gin Asp Ala 

20 25 30 

Val Asn lie Val Cys His Ser Lys Thr Arg Ser Asn Pro Glu Asn Asn 

35 40 45 

Val Gly Leu lie Thr Leu Ala Asn Asp Cys Glu Val Leu Thr Thr Leu 

50 / 55 60 

Thr Pro Asp Thr Gly >Arg lie Leu Ser Lys Leu His Thr Val Gin Pro 

65 / 7 0 7 5 8 0 

Lys Gly Lys lie Thr Phe Cys Thr Gly lie Arg Val Ala His Leu Ala 

6 5 9 0 9 5 

Leu Lys His Arg Gin Gly Lys Asn His Lys Met Arg lie lie Ala Phe 

100 105 110 

Val Gly Ser Pro Val Glu Asp Asn Glu Lys Asp Leu Val Lys Leu Ala 

115 120 125 

Lys Arg Leu Lys Lys Glu Lys Val Asn Val Asp lie lie Asn Phe Gly 

130 135 140 

Glu Glu Gib Val Asn Thr Glu Lys Leu Thr Ala Phe Val Asn Thr Leu 

145 150 155 160 

Asn Gly Lys Asp Gly Thr Gly Ser His Leu Val Thr Val Pro Pro Gly 

165 170 175 

Pro Ser Leu Ala Asp Ala Leu He Ser Ser Pro He Leu Ala Gly Glu 

180 185 190 

Gly Gly Ala Met Leu Gly Leu Gly Ala Ser Asp Phe Glu Phe Gly Val 

195 200 205 

Asp Pro Ser Ala Asp Pro Glu Leu Ala Leu Ala Leu Arg Val Ser Met 

210 I 215 / 220 / 



Glu Glu G * n A r^qT^) / A | g' GO. n \ pi u) {& 1 u ^^/q^ A r 9 ( A r ^ K ± a 0$ £ J J 

/All iSe'c Ala /Ala Glu Ala Gly He Ala Thr Thr Gly Thr Glu Asp Ser 

( ) ; x ./ . ... ^24 5 2 50 2 55 

Asp Asp Ala Leu Leu Lys Met Thr He Ser Gin Gin Glu Phe Gly Arg 
260 265 270 

Thr Gly Leu Pro Asp Leu Ser Ser Met Thr Glu Glu Glu Gin He Ala 
275 280 285 

Tyr Ala Met Gin Met Ser Leu Gin Gly Ala Glu Phe Gly Gin Ala Glu 
290 295 300 

Ser Ala Asp lie Asp Ala Ser Ser Ala Met Asp Thr Ser Glu Pro Ala 
305 310 315 320 

Lys Glu Glu Asp Asp Tyr Asp Val Met Gin Asp Pro Glu Phe Leu Gin 
325 330 335 

Ser Val Leu Glu Asn Leu Pro Gly Val Asp Pro Asn Asn Glu Ala lie- 
340 345 350 

Arg Asn Ala Met Gly Ser Leu Ala Ser Gin Ala Thr Lys Asp Gly Lys 

355 360 , 365 

Lys Asg Lys Lys Glu Glu Asr> Lys I*y 

(2) INFORMATION FOR SEQ ID NO;3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3841 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

iix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2 . . 3121 

(ix) FEATURE: 

(A) NAME /KEY : misc feature 
{ B> LOCATION : 1 . . 3F4 1 

(D> OTHER INFORMATION: /note" "GT2 4" 



SUBSTITUTE SHEET (RULE 26) 



WO 97/27296 



PCT/CA97/00051 



, . -99- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

T 



ISS S5S SIS i C rt Hi m ^ IX gC CGA CST ACG GGC CAG 

- •« §M 8g a? *£S If? SSg asas as SIS as as ES 



as as as g|S as S3 as SSS xaa SSS S5S as as IIS a G SSS 

as sis sg ttf a" as as as as as a: as as as as as 
as as as sis ?ss as s a K s is? as as as a" as as * cg s 
:j; as as as as as as as as a; aj as as as bs sis 
as as as as iff as as as as as as 55s as as as as 
as ia is? as as as as as as sis as as ?*? as 
as as ia as as as as fa si? is? as as sg sis is? as 
as sg as a? 1a as as as a? as as as Irs iss as is? 
as as as as sis gs is? as as as as ?*? as as sis as 

574 



as as i?s as sis as as as is? as as ?$s as s?s as is? 
as as as ?g as as as tS? sis as as as as as sis as 

x:,:> 200 205 

ai a; is? as as as ?$s as as as ai as as sis as as 
as g| as as sis as is? as as as as as as as as ;a 
fg as as a? ?5i as £?s as ai sis a- a! sis as as gf 
as a? as ai sis as as as as gs as as ;$s as a? lis 
as as as isi is? ai a; as a c as as as as as as ?a 
as a? as as as as as as as as as as as lis as as 
as gs ?a as as as as as is? a; ;*s as as as ?a as 
as sis as & in as as ai as as as ;*s as as is? gg 
ai SSI sis as as as as as as as as as as as ai as 
as a: as gs sss as as as gi a* as as as as as sis 

CCG G A A GTG ATT CAG ATG TTG CAG CAC CAG TTT CCC TCG GTC CAG TCT 1150 
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Pro Glu Val lie Gin Met Leu Gin His Gin Phe Pro Ser Val Gin Ser 

370 375 380 

AAC GCG GCA GCC TAC TTG CAA CAC CTC TGT TTT GGA G AC AAC AAA ATT 1196 

Asn Ala Ala Ala Tyr Leu Gin His Leu Cys Phe Gly Asp Asn Lys lie 
385 390 395 

AAA GCC GAG ATA AGG AGA CAA GGA GGC ATC CAG CTC CTG GTG G AC CTG 1246 

Lys Ala Glu He Arg Arg Gin Gly Gly He Gin Leu Leu Val Asp Leu 
400 405 410 415 

TTG GAT CAT CGG ATG ACC G A A GTC CAC CGT AGT GCC TGT GGA GCT CTG 1294 

Leu Asp His Arg Met Thr Glu Val His Arg Ser Ala Cys Gly Ala Leu 
420 425 430 

AGA AAC CTG GTG TAT GGG AAG GCC AAC GAT GAT AAC AAA ATT GCC CTG 1342 

Arg Asn Leu Val Tyr Gly Lys Ala Asn Asp Asp Asn Lys lie Ala Leu 

435 440 445 

AAA AAC TGT GGT GGC ATC CCA GCA CTG GTG AGG TTA CTC CGC AAG ACG 1390 

Lys Asn Cys Gly Gly He Pro Ala Leu Val Arg Leu Leu Arg Lys Thr 

450 455 460 

ACT GAC CTG GAG ATC CGG GAG CTG GTC ACA GGA GTC CTT TGG AAC CTC 1438 

Thr Asp Leu Glu lie Arg Glu Leu Val Thr Gly Val Leu Trp Asn Leu 
465 470 475 

TCC TCA TGC GAT GCA CTC AAA ATG CCA ATC ATC CAG GAT GCC CTA GCA 1486 

Ser Ser Cys Asp Ala Leu Lys Met Pro He He Gin Asp Ala Leu Ala 
480 485 490 495 

GTA CTG ACC AAC GCG GTG ATT ATC CCC CAC TCA GGC TGG GAA A AT TCG 1534 

Val Leu Thr Asn Ala Val He He Pro His Ser Gly Trp Glu Asn Ser 
500 505 510 

CCT CTT CAG GAT GAT CGG AAA ATA CAG CTG CAT TCA TCA CAG GTG CTG 1582 

Pro Leu Gin Asp Asp Arg Lys lie Gin Leu His Ser Ser Gin Val Leu 

515 52 0 52 5 



CGT AAC GCC ACC GGG TGC CTA AGG AAT GTT AGT TCG GCC GGA GAG GAG 1630 

Arg Asn Ala Thr Gly Cys Leu Arg Asn Val Ser Ser Ala Gly Glu Glu 

530 535 540 

GCC CGC AGA AGG ATG AGA GAG TGT GAT GGG CTT ACG GAT GCC TTG CTG 1678 

Ala Arg Arg Arg Met Arg Glu Cys Asp Gly Leu Thr Asp Ala Leu Leu 

545 550 555 

TAC GTG ATC CAG TCT GCG CTG GGG AGC AGT GAG ATC GAT AGC AAG ACC 1726 

Tyr Val He Gin Ser Ala Leu Gly Ser Ser Glu He Asp Ser Lys Thr 

560 565 570 575 



GTT GAA AAC TGT GTG TGC ATT TTA AGG AAC CTC TCG TAC CGG CTG GCG 1774 

Val Glu Asn Cys Val Cys He Leu Arg Asn Leu Ser Tyr Arg Leu Ala 

580 585 590 

GCA GAA ACG TCT CAG GGA CAG CAC ATG GGC ACG GAC GAG CTG GAC GGG 1822 

Ala Glu Thr Ser Gin Gly Gin His Met Gly Thr Asp Glu Leu Asp Gly 

595 600 605 



CTA CTC TGT GGC GAG GCC AAT GGC AAG GAT GCT GAG AGC TCT GGG TGC 1870 
Leu Leu Cys Gly Glu Ala Asn Gly Lys Asp Ala Glu Ser Ser Gly Cys 
610 615 620 



TGG GGC AAG AAG AAG AAG AAA AAG AAA TCC CAA GAT CAG TGG GAT GGA 19 18 

Trp Gly Lys Lys Lys Lys Lys Lys Lys Ser Gin Asp Gin Trp Asp Gly 
625 630 635 



GTA GGA CCT CTT CCA GAC TGT GCT GAA CCA CCA AAA GGG ATC CAG ATG 1966 
Val Gly Pro Leu Pro Asp Cys Ala Glu Pro Pro Lys Gly lie Gin Met 
640 64$ 650 655 



CTG TGG CAC CCA TCA ATA GTC AAA CCC TAC CTC ACA CTG CTC TCT GAG 2014 

Leu Trp His Pro Ser He Val Lys Pro Tyr Leu Thr Leu Leu Ser Glu 

660 665 670 

TGC TCA AAT CCA GAC ACG CTG GAA GGG GCG GCA GGC GCC CTG CAG AAC 2062 

Cys Ser Asn Pro Asp Thr Leu Glu Gly Ala Ala Gly Ala Leu Gin Asn 
675 680 685 

TTG GCT GCA GGG AGC TGG AAG TGG TCA GTA TAT ATC CGA GCC GCT GTC 2110 

Leu Ala Ala Gly Ser Trp Lys Trp Ser Val Tyr He Arg Ala Ala Val 

690 695 ' 70S 

CGA AAA GAG AAA GGC CTG CCC ATC CTC GTG GAG CTG CTC CGA ATA GAC 2158 

Arg Lys Glu Lys Gly Leu Pro He Leu Val Glu Leu Leu Arg lie Asp 

705 710 715 

AAT GAC CGT GTG GTG TGC GCG GTG GCC ACT GCG CTG CGG AAC ATG GCC 2206 

Asn Asp Arg Val Val Cys Ala val Ala Thr Ala Leu Arg Asn Met Ala 

720 725 730 735 

TTG GAC GTC AGA AAT AAG GAG CTC ATC GGC AAA TAC GCC ATG CGA GAC 2254 

Leu Asp Val Arg Asn Lys Glu Leu He Gly Lys Tyr Ala Met Arg Asp 

740 745 750 



SUBSTITUTE SHEET (RULE 26) 



WO 97/27296 



PCT/CA97/00051 



s sis sis j|| g» »» - aslis 1 "re as ;;s tn re re »•• 

7 60 7 65 

a?^ 5 T 5 I CG GAT GAC ACA GTG ACA GCT GTC TGC TGC AC A CTf r»r 
Lys Ala Met Ser Asp Asd Thr Val Thr &i i SI V i„- I?? A F A ? TG ^AC 



£s Thr Leu His 



2350 



2398 



r *° wai ijAt ACA GTG ACA GCT GTC rnr rr 

* "IS »" B " > Tht ;«| *»" °" 5" Iff cj 
IK 51! SR ?S? £« j;s S|| g?S ft fff £5S ||| •» ff° ■;» S j 

IIx «? «" t» j« 5If If? !If is ;»; i|| »; g f ; «« - .... 

5 " Ki HI si? "s sfr if; |f; Its nf as as ssf III 



815 

CAG GTC CTC AAC AGC 
825 Val LeU ASA Ser 

^ SM ?- S|| a; g« MS MI KS ? A S £ AA 2*- a* S?* ??? IS* »« 



840 y yS P fj£ 

Si! ??? HI IB St iff If? gj g* ?S C .jc c f ; g. gc e ? s Cf. 

tff fff «f Iff Iff Iff m « f« |« .,C ,« g, CJJ g, „j 

Hi fff 55f ;;s fff if; as «t ss if; ffi if; s: as g;; s:? 

8 90 895 
KS »S ^ fg S2- - - - gc ACC gjc ACC JJC 

25S ^ ? A s sg i?? a; a? fig sts ;ss sss ss; AA f a; A ?s 

E? T £r If! SM S A S *£I G? A Jg Ser £1 IS JfJ MS SSI ?$I 2 "° 

g " lit ks i aa as |i| 2 AA s?s asS G ?s fi| is* sts ss: 

fH ~° SM SK S3 fff Ar g Lys «J ??r i?S A ~ ??< g? G SS2 gj 
§52 MI $SS ?£5 Ar g Asn l?r Asp »j ||| Ph ~ §*S Mp §g 51? 

S A i S A S a? c g| aL c MS 8M ?J? o ?£i SI? S A f LeS |g s £SS AA 3 »" 

sss ?ss |? c ro A AA s ? a s 5ii as ^s 5 ? A r is a s?s A ?s ||| 0 ^%^i ai 

§ AA ks 5 a aa s ? a i i?s as o „ CA s ?js ip 3 ill Hi ?S P G »» 

Val TGAGGA GCAG GGCACAGGCG CTCCGGGAAA CAGTGCATGT GCATGCATAC 3171 
1040 11 

CAC AAGACAT TTCTTTCTGT TTTGGTTTTT TTCTCCTGCA AATTTAGTTT GTTAAAGCCT 

GTTCCATAGG AAGGCTGTGA TAACCAGTAA GGGAAATATT AAGAGCTATT TTAGAAAGCT 

AAATGAATCG CAAGTTAACT TGGAAATCAG TAGAAAGCTA AAGTGATCCT AAAT ATG AC A 

GTGGGCAGCA CCTTTCCTAG CGTGTTNTGT TAGGAGTAAC GAGAAGTGC T TTATACTGAA 

CGTGGGTTGN TTGGT AGGGT GGAGNCGAGG CATTCGGGCC GGTGGGGCGT AAGGGTTATC 

GTTAAGCACA AGACACAGAA TAGTTTACAC ACTGTGTGGG GGACGGCTTC TCACGCTTTG 

TTTACTCTCT TCATCCGTTG TGACTCTAGG CTTC AGGTTG CATTGGGGTT CCTCTGTACA 

GCAAGATGTT TCTTGCCTTT TGTTAATGCA TTGTTGTAAA GTATTTGATG T ACATTAC AG 

ATTA AAGAAG NAAAGCGCGT TGTGTATATT ACACC AATNC CGCCGTGTTT CCTCATCTAT 

GGTTCT AAAT ATTGCTTCAA TTTCNA ACTT TTGAAAGATG TATGGATTTC C AGTTTTTCT 
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2590 



2638 



2 68 6 



2734 



2782 



2878 



2926 



3231 
3291 
3351 
3411 
34 7 1 
3531 
3591 
3651 
3711 
377 1 
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TT ACTTTCTC CCAGTATGTT TTAACCNMMN A AAA A A AAA A AAAAAAAAAA AAAAAAAAAA 3831 
A AAACTCGAG 3841 

{2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1040 amino acids 
(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Ser Gin Leu Pro Ala Arg Gly Thr Gin Ala Arg Xaa Thr Gly Gin Ser 
1 5 10 15 

Phe Ser Gin Gly Thr Thr Ser Arg Ala Gly His Leu Ala Gly Pro Glu 
20 25 30 

Pro Ala Pro Pro Pro Pro Pro Xaa Pro Arg Glu Pro Phe Ala Pro Ser 
35 40 45 

Leu Gly Ser Ala Phe His Leu Pro Asp Ala Pro Pro Ala Ala Ala Ala 
SO 55 60 

Ala Ala Leu Tyr Tyr Ser Xaa Ser Thr Leu Pro Ala Pro Pro Arg Gly 
65 70 75 80 

Gly Ser Pro Leu Ala Ala Pro Gin Gly Gly Ser Pro Thr Lys Leu Gin 
85 90 95 

Arg Gly Gly Ser Ala Pro Glu Gly Ala Thr Tyr Ala Ala Pro Arg Gly 
100 105 110 

Ser Ser Pro Lys Gin Ser Pro Ser Arg Leu Ala Lys Ser Tyr Ser Thr 
115 120 125 

Ser Ser Pro lie Asn He Val Val Ser Ser Ala Gly Leu Ser Pro He 
130 135 140 

Arg Val Thr Ser Pro Pro Thr Val Gin Ser Thr lie Ser Ser Ser Pro 
145 150 155 160 

He His Gin Leu Ser Ser Thr He Gly Thr Tyr Ala Thr Leu Ser Pro 
165 170 175 

Thr Lys Arg Leu Val His Ala Ser Glu Gin Tyr Ser Lys His Ser Gin 
180 165 190 

Glu Leu Tyr Ala Thr Ala Thr Leu Gin Arg Pro Gly Ser Leu Ala Ala 
1§5 200 " 205 

Gly Ser Arg Ala Ser Tyr Ser Ser Gin His Gly His Leu Gly Pro Glu 
210 215 220 

Leu Arg Ala Leu Gin Ser Pro Glu His His lie Asp Pro He Tyr Glu 
225 230 235 240 

Asp Arg Val Tyr Gin Lys Pro Pro Met Arg Ser Leu Ser Gin Ser Gin 
245 250 255 

Gly Asp Pro Leu Pro Pro Ala His Thr Gly Thr Tyr Arg Thr Ser Thr 
260 265 270 

Ala Pro Ser Ser Pro Gly Val Asp Ser Val Pro Leu Gin Arg Thr Gly 
275 280 285 

Ser Gin His Gly Pro Gin Asn Ala Ala Ala Ala Thr Phe Gin Arg Ala 
2 90 295 30 0 

Ser Tyr Ala Ala Gly Pro Ala Ser Asn Tyr Ala Asp Pro Tyr Arg Gin 
305 310 315 320 

Leu Gin Tyr Cys Pro Ser Val Glu Ser Pro Tyr Ser Lys Ser Gly Pro 
325 330 335 

Ala Leu Pro Pro Glu Gly Thr Leu Ala Arg Ser Pro Ser He Asp Ser 
340 345 350 

He Gin Lys Asp Pro Arg Glu Phe Gly Trp Arg Asp Pro Glu Leu Pro 
3§5 360 365 

Glu Val He Gin Met Leu Gin His Gin Phe Pro Ser Val Gin Ser Asn 
370 375 380 

Ala Ala Ala Tyr Leu Gin His Leu Cys Phe Gly Asp Asn Lys He Lys 
385 390 395 400 

Ala Glu He Arg Arg Gin Gly Gly He Gin Leu Leu Val Asp Leu Leu 
405 410 415 
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Asp His Arg Met Thr Clu Val His Arg Ser Ala Cys Gly Ala Leu Arg 

425 430 

Asn Leu Val Tyr Gly Lys Ala Asn Asp Asp Asn Lys He Ala Leu Lys 
J J 440 445 

Asn Cvs Gly Gly lie Pro Ala Leu Val Arg Leu Leu Arg Lys Thr Thr 
JU 155 460 

AS| Leu Glu He Arg Glu Leu Val Thr Gly Val Leu Trp Asn Leu Ser 



480 



Ser Cys Asp Ala Leu Lys Met Pro lie lie Gin Asp Ala Leu Ala Val 
485 490 495 

Leu Thr Asn Ala Val He lie Pro His Ser Gly Trp Glu Asn Ser Pro 
5U 0 505 5io 

Leu Gin Asp Asp Arg Lys He Gin Leu His Ser Ser Gin Val Leu Arg 
3 13 520 525 

Asn Ala Thr Gly Cys Leu Arg Asn Val Ser Ser Ala Gly Glu Glu Ala 

Arg Arg Arg Met Arg Glu Cys Asp Gly Leu Thr Asp Ala Leu Leu Tyr 

550 555 560 

Val He Gin Ser Ala Leu Gly Ser Ser Glu lie Asp Ser Lys Thr Val 
565 570 575 

Glu Asn Cys Val Cys He Leu Arg Asn Leu Ser Tyr Arg Leu Ala Ala 
*>oU 585 590 

Glu Thr Ser Gin Gly Gin His Met Gly Thr Asp Glu Leu Asp Gly Leu 
535 600 605 

Leu Cys Gly Glu Ala Asn Gly Lys Asp Ala Glu Ser Ser Gly Cys Trp 

6 2 0 

Gly Lys Lys Lys Lys Lys Lys Lys Ser Gin Asp Gin Trp Asp Gly Val 

bJ0 635 640 

Gly Pro Leu Pro Asp Cys Ala Glu Pro Pro Lys Gly He Gin Met Leu 
645 650 655 

Trp His Pro Ser He Val Lys Pro Tyr Leu Thr Leu Leu Ser Glu Cys 
660 665 670 

Ser Asn Pro Asp Thr Leu Glu Gl£ Ala Ala Gly Ala Leu Gin Asn Leu 

Ala Ala Gly Ser Trp Lys Tr| Ser Val Tyr He Arg Ala Ala Val Arg 

Lys Glu Lys Gly Leu Pro He Leu Val Glu Leu Leu Arg He Asp Asn 

/1U 715 720 

Asp Arg Val Val Cys Ala Val Ala Thr Ala Leu Arg Asn Met Ala Leu 
725 730 735 

Asp Val Arg Asn Lys Glu Leu He Glj Lys Tyr Ala Met Arg Asp Leu 

Val His Arg Leu Pro Gly Gly Asn Asn Ser Asn Asn Thr Ala Ser Lys 
' " 7 60 765 

Ala Met Ser Asp Asp Thr Val Thr Ala Val Cys Cgs Thr Leu His Glu 

Val He Thr Lys Asn Met Glu Asn Ala Lys Ala Leu Arg Asp Ala Gly 

90 800 
Gly He Glu Lys Leu Val Gly He Ser Lvs Ser Lys Gly Asp Lys His 

Ser Pro Lys Val val Lys Ala Ala Ser Gin Val Leu Asn Ser Met Trp 

Gin Tyr Ara Asp Leu Arg Ser Leu Tyr Lys Lys Asp Gly Trp Ser Gin 

Tyr His Phe Val Ala Ser Ser Ser Thr He Glu Arg Asp Arg Gin Arg 

Pro Tyr Ser Ser Ser Arc Thr Pro Ser He Ser Pro Val Arg Val Ser 

° ' u 8 7 5 8 8 0 

Pro Asn Asn Arg Ser Ala Ser Ala Pro Ala Ser Pro Arg Glu Met He 

0B5 890 895 

Ser Leu Lys Glu Arg Lys Thr Asp Tyr Glu Cys Thr Gly |er Asn Ala 

Thr Tyr His Gly Gly Lys Gly Glu His Thr Ser Arg Lys Asp Ala Met 
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Thr Ala Gin Asn Thr Gly lie Ser Thr Leu Tyr Arg Asn Ser Tvr Glv 
930 935 940 y 

Ala Pro Ala Glu Asp lie Lys His Asn Gin Val Ser Ala Gin Pro Val 
945 950 955 960 

Pro Gin Glu Pro Ser Arg Lys Asp Tyr Glu Thr Tyr Gin Pro Phe Gin 
965 970 975 

Asn Ser Thr Arg Asn Tyr Asp Glu Ser Phe Phe Glu Asp Gin Val His 
980 985 990 

His Arg Pro Pro Ala Ser Glu Tyr Thr Met His Leu Gly Leu Lys Ser 
995 1000 1005 

Thr Gly Asn Tyr Val Asp Phe Tyr Ser Ala Ala Arg Pro Tyr Ser Glu 
1010 1015 1020 

Leu Asn Tyr Glu Thr Ser His Tyr Pro Ala Ser Pro Asp Ser Trp Val 
1025 1030 1035 H 1040 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 142. .3777 

(D) OTHER INFORMATION: /note- "p0071 w 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CTACTGTTGT TTTTGAGGGG CGGGC AGCCG CGCCGCCGCG GCACTTTTTT A ATT TTTTCG 60 

GGTGCCGC AG CAGCGACCCC TCGGCGCCGA TGTCCCTGAT CCCTGGAGCG ACGACGGCCG 120 

CTGCCTAAGC TGGGAAGAGG A ATG CCA GCT CCT GAG CAG GCC TCA TTG GTG 17 1 

Met Pro Ala Pro Glu Gin Ala Ser Leu Val 
15 10 

GAG GAG GGG CAA CCA CAG ACC CGC CAG GAA GCT GCC TCC ACT GGC CCA 219 
Glu Glu Gly Gin Pro Qln Thr Arg Gin Glu Ala Ala Ser Thr Gly Pro 
15 20 25 

GGC ATG GAA CCC GAG ACC ACA GCC ACC ACT ATT CTA GCA TCC GTG AAG 267 
Gly Met Glu Pro Glu Thr Thr Ala Thr Thr lie Leu Ala Ser Val Lys 
30 35 40 

GAG CAG GAG CTT CAG TTT CAG CGA CTC ACC CGA GAA CTG GAA GTG GAA 315 
Glu Gin Glu Leu Gin Phe Gin Arg Leu Thr Arg Glu Leu Glu Val Glu 
4 5 5 0 5 5 

AGG CAG ATT GTT GCC AGT CAG CTA GAA AGA TGT AGG CTT GGA GCA GAA 3 63 

Arg Gin lie Val Ala Ser Gin Leu Glu Arg Cys Arg Leu Gly Ala Glu 
60 65 70 

TCA CCA AGC ATC GCC AGC ACC AGC TCA ACT GAG AAG TCA TTT CCT TGG 411 
Ser Pro Ser lie Ala Ser Thr Ser Ser Thr Glu Lys Ser Phe Pro Trp 
75 80 85 90 

AGA TCA ACA GAC GTG CCA AAT ACT GGT GTA AGC AAA CCT AGA GTT TCT 4 59 

Arg Ser Thr Asp Val Pro Asn Thr Gly Val Ser Lys Pro Arg Val Ser 
95 100 105 

GAC GCT GTC CAG CCC AAC AAC TAT CTC ATC AGG ACA GAG CCA GAA CAA 507 
Asp Ala Val Gin Pro Asn Asn Tyr Leu lie Arg Thr Glu Pro Glu Gin 
110 115 120 

GGA ACC CTC TAT TCA CCA GAA CAG ACA TCT CTC CAT GAA AGT GAG GGA 555 
Gly Thr Leu Tyr Ser Pro Glu Gin Thr Ser Leu His Glu Ser Glu Gly 
125 130 135 

TCA TTG GGT AAC TCA AGA AGT TCA ACA CAA ATG AAT TCT TAT TCC GAC 603 
Ser Leu Gly Asn Ser Arg Ser Ser Thr Gin Met Asn Ser Tyr Ser Asp 
140 145 150 

AGT GGA TAC CAG GAA GCA GGG AGT TTC CAC AAC AGC CAG AAC GTG AGC 651 
Ser Gly Tyr Gin Glu Ala Gly Ser Phe His Asn Ser Gin Asn Val Ser 
15S 160 165 170 

AAG GCA GAC AAC AGA CAG CAG CAT TCA TTC ATA GGA TCA ACT AAC AAC 699 
Lys Ala Asp Asn Arg Gin Gin His Ser Phe lie Gly Ser Thr Asn Asn 
175 180 185 

CAT GTG GTG AGG AAT TCA AGA GCT GAA GGA CAA ACA CTG GTT CAG CCA 747 
His Val Val Arg Asn Ser Arg Ala Glu Gly Gin Thr Leu Val Gin Pro 
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190 195 200 



987 



1035 



1083 



TCA GTA GCC AAT CGG GCC ATG AG A AGA GTT AGT TCA GTT CCA TCT AGA 795 
Ser Val Ala Asn Arg Ala Met Arg Arg Val Ser Sei Val Pro Ser Arc 
205 210 215 

GCA CAG TCT CCT TCT TAT GTT ATC AGC ACA GGC GTG TCT CCT TCA AGG 843 
Aia G i2 Ser Pro Ser T y r Val Ile Ser Thr G1 V v al Ser Pro Ser Arg 
220 225 230 

GGG TCT CTG AGA ACT TCT CTG GGT AGT GGA TTT GGC TCT CCG TCA GTG 891 
Gly ser Leu Arg Thr Ser Leu Gly Ser Gly Phe Gly Ser Pro Ser Val 
235 240 2 4 5 ^ 250 

ACC GAC CCC CGA CCT CTG AAC CCC AGT GCA TAT TCC TCC ACC ACA TTA 939 
Thr Asp Pro Arg Pro Leu Asn Pro Ser Ala Tyr Ser Ser Thr Thr Leu 
255 260 265 

CCT GCT GCA CGG GCA GCC TCT CCG TAC TCA CAG AGA CCC GCC TCC CCA 
Pro Ala Ala Arg Ala Ala Ser Pro Tyr Ser Gin Arg Pro Ala Ser Pro 
270 275 280 

ACA GCT ATA CGG CGG ATT GGG TCA GTC ACC TCC CGG CAG ACC TCC AAT 
Thr Ala lie Arg Arg Ile Gly Ser Val Thr Ser Arg Gin Thr Ser Asn 
285 2 90 295 

AA ° r? A S CA SS C S CT CAA TAC CAA ACC ACC GC C AGA GTG GGG TCC 
Pro Asn Gly Pro Thr Pro Gin Tyr Gin Thr Thr Ala Arg Val Gly Ser 
300 305 310 

CCA CTG ACC CTG ACG GAT GCA CAG ACT CGA GTA GCT TCC CCA TCC CAA 1131 
Pro Leu Thr Leu Thr Asp Ala Gin Thr Arg Val Ala Ser Pro Ser Gin 
JAD 320 325 330 

r A n ST? I CG I CG I CC CCC AAA CGC TCA GGG ATG ACC GCC GTA 1179 

Gly Gin Val Gly Ser Ser Ser Pro Lys Aro Ser Gly Met Thr Ala Val 
335 340 345 

Sro r A n S A I ? TG ?? A £ CT I CA CTG S AA AGG ACT GTT CAT GAC ATG GAG 1227 
Pro Gin His Leu Gly Pro Ser Leu Gin Arg Thr Val His Asp Met Glu 
350 355 

CAA TTC GGA CAG CAG CAG TAT GAC ATT TAT GAG AGG ATG GTT CCA CCC 1275 
Gin Phe Gly Gin Gin Gin Tyr Asp Ile Tyr Glu Arg Met Val Pro Pro 
365 370 375 

AGG CCA GAC AGC CTG ACA GGC TTA CGG AGT TCC TAT GCT AGT CAG CAT 1323 
Arg Pro Asp Ser Leu Thr Gly Leu Arg Ser Ser Tyr Ala Ser Gin His 

AGT CAG CTT GGG CAA GAC CTT CGT TCT GCC GTG TCT CCC GAC TTG CAC mi 
Ser Gin Leu Gly Gin Asp Leu Arg Ser Ala Val Ser Pro tip Leu His 1 

4 °0 405 410 

ill tSI S CT ?T A I AT £ AG GGG AGG ACC TAT TAC ft GC CCA GTG TAC CGC 
He Thr Pro Ile Tyr Glu Gly Arg Thr Tyr Tyr Ser Pro Val Tyr Arg 
415 420 425 

AGC CCA AAC CAT GGA ACT GTG GAG CTC CAA GGA TCG CAG ACG GCC TTr 
Ser Pro Asn His Gly Thr Val Glu Leu Gin Gly Ser Bin Thr Ala llu 
qjQ 435 440 

I AT CGC A F A GGT GTA TCA GGT ATT GGA AAT CTA CAA AGG ACA TCC AGC 
Tyr Arg Thr Gly Val Ser Gly lie Gly Asn LeS Gin Ar| Thr lev Ser 
1*3 450 455 

rt A £ GA AGT 2 CC CTT ACA TAC CAA AGA AAT AAT TAT GCT CTG AAC ACA ISfil 
Gin Arg Ser Thr Leu Thr Tgr Gin Arg Asn Asn Tyr All LeS Asn Thr 1563 

ACA GCT ACC TAC GCG GAG CCC TAC AGG CCT ATA CAA TAC CGA r.Tr ran 
Thr Ala Thr Tyr Ala Glu Pro Tyr Arg Pro lie Gin ?yr Arg vll §1* 

" u 485 490 

as ir, m ;|| ass m C ? G s a i g: sa m m g« «* g« 

? cc W AG $ Iff SS2 I" }K 5}f 5R 2tf 8£ ccc 0 £ GG 8SS 

iU 515 520 

Phi GCC S GG CGT GAT CCT GAG TTG CCT G * G GTC ATT CAC ATG CTT GAG 175 5 

Phe Ala Tid Arg Asp Pro Glu Leu Pro Glu Val lie His Se? ill Glu 

530 535 

1% III Ifo K! Sll HI aSs Asn Ala A?a |fS ?yr Leu §tf S? G 
Iff SJ$ A A p «S Lys G Ix £5« Met i? G vl? 2£ Arg Hu f? G 

GGA ATC AAG CAT CTG GTT GAC CTT CTG GAC CAC AGA GTT TTG GAA GTT 
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Gly lie Lys His Leu Val Asp Leu Leu Asp His Arg Val Leu Glu Val 

575 580 - 585 

CAG AAG AAT GCT TGT GGT GCC CTT CGA AAC CTC GTT TTT GGC AAG TCT 19 4 7 

Gin Lys Asn Ala Cys Gly Ala Leu Arg Asn Leu Val Phe Gly Lys Ser 

590 595 600 

ACA GAT GAA AAT AAA ATA GCA ATG AAG AAT GTT GGT GGG ATA CCT GCC 1995 

Thr Asp Glu Asn Lys lie Ala Met Lys Asn Val Gly Gly lie Pro Ala 

605 610 615 

TTG TTG CGA CTG TTG AGA AAA TCT ATT GAT GCA GAA GTA AGG GAG CTT 204 3 

Leu Leu Arg Leu Leu Arg Lys Ser He Asp Ala Glu Val Arg Glu Leu 

620 625 630 

GTT ACA GGA GTT CTT TGG AAT TTA TCC TCA TGT GAT GCT GTA AAA ATG 2091 

Val Thr Gly Val Leu Trp Asn Leu Ser Ser Cys Asp Ala Val Lys Met 

635 640 645 650 

ACA ATC ATT CGA GAT GCT CTC TCA ACC TTA ACA AAC ACT GTG ATT GTT 2 139 

Thr He He Arg Asp Ala Leu Ser Thr Leu Thr Asn Thr Val He Val 

655 660 665 

CCA CAT TCT GGA TGG AAT AAC TCT TCT TTT GAT GAT GAT CAT AAA ATT 2 187 

Pro His Ser Glv Trp Asn Asn Ser Ser Phe Asp Asp Asp His Lys He 

67 0 67 5 680 

AAA TTT CAG ACT TCA CTA GTT CTG CGT AAC ACG ACA GGT TGC CTA AGG 2235 

Lys Phe Gin Thr Ser Leu Val Leu Arg Asn Thr Thr Gly Cys Leu Arg 

685 690 695 

AAC CTC ACG TCC GCG GGG GAA GAA GCT CGG AAG CAA ATG CGG TCC TGC 2283 

Asn Leu Thr Ser Ala Gly Glu Glu Ala Arg Lys Gin Met Arg Ser Cys 

700 705 710 

GAG GGG CTG GTA GAC TCA CTG TTG TAT GTG ATC CAC ACG TGT GTG AAC 2331 

Glu Gly Leu Val Asp Ser Leu Leu Tyr Val He His Thr Cys Val Asn 

715 720 725 730 

ACA TCC GAT TAC GAC AGC AAG ACG GTG GAG AAC TGC GTG TGC ACC CTG 2379 

Thr Ser Asp Tyr Asp Ser Lys Thr Val Glu Asn Cys Val Cys Thr Leu 

735 740 745 

AGG AAC CTG TCC TAT CGG CTG GAG CTG GAG GTG CCC CAG GCC CGG TTA 2427 

Arg Asn Leu Ser Tyr Arg Leu Glu Leu Glu Val Pro Gin Ala Arg Leu 

750 755 760 

CTG GGA CTG AAC GAA TTG GAT GAC TTA CTA GGA AAA GAG TCT CCC AGC 2475 

Leu Gly Leu Asn Glu Leu Asp Asp Leu Leu Gly Lys Glu Ser Pro Ser 

765 77b 775 

AAA GAC TCT GAG CCA AGT TGC TGG GGG AAG AAG AAG AAA AAG AAA AAG 2523 

Lys Asp Ser Glu Pro Ser Cys Trp Gly Lys Lys Lys Lys Lys Lys Lys 

780 7 85 790 

AGG ACT CCG CAA GAA GAT CAA TGG GAT GGA GTT GGT CCT ATC CCA GGA 257 1 

Arg Thr Pro Gin Glu Asp Gin Trp Asp Gly Val Gly Pro He Pro Gly 

795 800 805 8l5 

CTG TCG AAG TCC CCC AAA GGG GTT GAG ATG CTG TGG CAC CCA TCG GTG 2619 

Leu Ser Lys Ser Pro Lys Gly Val Glu Met Leu Trp His Pro Ser Val 

815 820 825 

GTA AAA CCA TAT CTG ACT CTT CTA GCA GAA AGT TCC AAC CCA GCC ACC 2667 

val Lys Pro Tyr Leu Thr Leu Leu Ala Glu Ser Ser Asn Pro Ala Thr 

830 835 840 

TTG GAA GGC TCT GCA GGG TCT CTC CAG AAC CTC TCT GCT AGC AAC TGG 27 15 

Leu Glu Gly Ser Ala Gly Ser Leu Gin Asn Leu Ser Ala Ser Asn Trp 

B45 850 855 

AAG TTT GCA GCA TAT ATC CGG GGC GGC CGT CCG AAA AGA AAA GGG CTC 2763 

Lys Phe Ala Ala Tyr lie Arg Gly Gly Arg Pro Lys Arg Lys Gly Leu 

860 865 870 

CCC ATC CTT GTG GAG CTT CTG AGA ATG GAT AAC GAT AGA GTT GTT TCT 2811 

Pro He Leu Val Glu Leu Leu Arg Met Asp Asn Asp Arg Val Val Ser 

875 880 885 890 

TCC GGT GCA ACA GCC TTG AGG AAT ATG GCA CTA GAT GTT CGC AAC AAG 28 59 

Ser Gly Ala Thr Ala Leu Arg Asn Met Ala Leu Asp Val Arg Asn Lys 

895 900 905 

GAG CTC ATA GGC AAA TAC GCC ATG CGA GAC CTG GTC AAC CGG CTC CCC 2907 

Glu Leu lie Gly Lys Tyr Ala Met Arg Asp Leu Val Asn Arg Leu Pro 

910 915 920 

GGC GGC AAT GGC CCC AGT GTC TTG TCT GAT GAG ACC ATG GCA GCC ATC 2955 

Gly Gly Asn Gly Pro Ser Val Leu Ser Asp Glu Thr Met Ala Ala He 

925 930 935 

TGC TGT GCT CTG CAC GAG GTC ACC AGC AAA AAC ATG GAG AAC GCA AAA 3003 

Cys Cys Ala Leu His Glu Val Thr Ser Lys Asn Met Glu Asn Ala Lys 

9*0 945 950 
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GAATGAATGA AGTGTGTTTT TTTTTCCTTT TTGAGGAATT ATCAGGGGAA TTCGATATCA 
AGCTTATCGA TACCGTCGAC 

(2) INFORMATION FOR SEQ ID NO: 6: 

<i) SEQUENCE CHARACTERISTICS : 

i52S TH : * 212 anino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE; protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



Met 
1 


Pro 


Ala 


Pro 


Thr 


Arg 


Gin 


Glu 
20 


Thr 


Ala 


Thr 
35 


Thr 


Gin 


Arg 
50 


Leu 


Thr 



10 



Jer Thr Glv Pro Gly Met Glu Pr 
2S 3 



n Pro Gin 
15 

o Glu Thr 
0 



Le Leu Ala Ser Val Lys Glu Gin Glu Leu Gin Phe 

H 4 5 

:<J Glu Leu Glu Val Glu Arg Gin lie Val Ala Ser 

6 0 



305 1 
3099 
3147 



-107- 

GCC CTG GCC G AC TCA GGA GGC ATA GAG AAG CTG GTG AAC ATA Arr aaa 

Ala Leu Ala Asp Ser Glv Gly He Glu Lys Leu vll Asn lie Th? tys 

965 970 

r?5 AGG 5 GC GAC AGA TCA TCT CTG AAA GTG GTG AAG GCA GCA CCC rnr 
Gly Arg Gly Asp Arc Ser Ser Leu Lys Val vll iyt Ala Ala All G?n 
y ' 5 980 995 

sis r:; «: ?s; us ??? ?m ||| m eis %%% j« 
ts zn fg «s si a; s;s a: a; $n ss.ss si? is7?s; us •»» 

xuiu 1015 

§?s f|f 0 25? us £?; is; ki is; is? us jn ?ss ;ss ss if; 

SEN! IS5 ESS t" \HW IS; SIE 58 SSfsTlSI I" IS; ||| o 

k eis ra if; jiys «s is: ss§ isiJk ?» e* ;?= fjsJsT »»■ 
ss; if: sis sf; o ;j; »• s« sj^a; •« s; fS » sKt;"i?5 
eis ?;? ||| s if5 is; ;ss ssi o is; ?s; ?n {isJsTis? i;? 

? " ?!s." ; °" »s °" ?is s 5 " J " 5 " - fifjii sm sts eis 
$ s ?;i w a: in sjys j;s si; t?? Msjnsj; s: ??? ^ 
C! ?;i a: its inss: a: ss? sgjsss ?;? m t$zi 

ESS 511 SiS JKSK til ISI SSI J^?}? s» »; 5" t^EIS 

ss i" s?j 5 ?s; ;;i ?;? si; gyn s: is? ;s? {ftjsTss sh 
jfj,» as s;s ;;? es; s s?$ is? &. ;«< isjjts'si? ?;? «; 
gys e?s as s;s jgjas is JK ?n ei^sTiis us an j« 

AGG TGA AAAGTCCATC TTGCTGATTT CATGATTGAA ATGTGAAAGT GAAGTGGAAG 



353 1 



3 579 



3 627 



3675 



3723 



3827 

3887 
3907 
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Gin Leu Glu Arc Cys Arg Leu Gly Ala Glu Ser Pro ser lie Ala Ser 
65 70 75 80 

Thr Ser Ser Thr Glu Lys Ser Phe Pro Trp Arg Ser Thr Asp Val Pro 
8 5 90 95 

Asn Thr Gly Val Ser Lys Pro Arg Val Ser Asp Ala Val Gin Pro Asn 
100 105 110 

Asn Tyr Leu lie Arg Thr Glu Pro Glu Gin Gly Thr Leu Tyr Ser Pro 
115 120 125 

Glu Gin Thr Ser Leu His Glu Ser Glu Gly Ser Leu Gly Asn Ser Arg 
130 135 140 

Ser Ser Thr Gin Met Asn Ser Tyr Ser Asp Ser Gly Tyr Gin Glu Ala 
145 150 155 160 

Gly Ser Phe His Asn Ser Gin Asn Val Ser Lys Ala Asp Asn Arg Gin 
165 170 175 

Gin His Ser Phe He Gly Ser Thr Asn Asn His Val Val Arg Asn Ser 
180 185 190 

Arg Ala Glu Gly Gin Thr Leu Val Gin Pro Ser Val Ala Asn Arg Ala 
195 200 205 

Met Arg Arg Val Ser Ser Val Pro Ser Arg Ala Gin Ser Pro Ser Tyr 
210 215 220 

Val He Ser Thr Gly Val Ser Pro Ser Arg Gly Ser Leu Arg Thr Ser 
225 230 235 240 

Leu Gly Ser Gly Phe Gly Ser Pro Ser Val Thr Asp Pro Arg Pro Leu 
245 250 255 

Asn Pro Ser Ala Tyr Ser Ser Thr Thr Leu Pro Ala Ala Arg Ala Ala 
260 265 270 

Ser Pro Tyr Ser Gin Arg Pro Ala Ser Pro Thr Ala lie Arg Arg He 
275 280 285 

Gly Ser val Thr Ser Arg Gin Thr Ser Asn Pro Asn Gly Pro Thr Pro 
290 295 300 

Gin Tyr Gin Thr Thr Ala Arg Val Gly Ser Pro Leu Thr Leu Thr Asp 
305 310 315 320 

Ala Gin Thr Arg Val Ala Ser Pro Ser Gin Gly Gin Val Gly Ser Ser 
325 330 335 

Ser Pro Lys Arg Ser Gly Met Thr Ala Val Pro Gin His Leu Gly Pro 
340 345 350 

Ser Leu Gin Arg Thr Val His Asp Met Glu Gin Phe Gly Gin Gin Gin 
355 360 365 

Tyr Asp lie Tyr Glu Arg Met Val Pro Pro Arg Pro Asp Ser Leu Thr 
* 37b 375 380 

Gly Leu Arg Ser Ser Tyr Ala Ser Gin His Ser Gin Leu Gly Gin Asp 
385 390 395 400 

Leu Arg Ser Ala Val Ser Pro Asp Leu His He Thr Pro He Tyr Glu 
405 410 415 

Gly Arg Thr Tyr Tyr Ser Pro Val Tyr Arg Ser Pro Asn His Gly Thr 
42 0 42 5 4 30 

Val Glu Leu Gin Gly Ser Gin Thr Ala Leu Tyr Arg Thr Gly Val Ser 
435 440 445 

Gly He Gly Asn Leu Gin Arg Thr Ser Ser Gin Arg Ser Thr Leu Thr 
450 455 460 

Tyr Gin Arg Asn Asn Tyr Ala Leu Asn Thr Thr Ala Thr Tyr Ala Glu 
4*55 470 475 4 80 

Pro Tyr Arg Pro He Gin Tyr Arg Val Gin Glu Cys Asn Tyr Asn Arg 
485 4 90 495 

Leu Gin His Ala Val Pro Ala Asp Asp Gly Thr Thr Arg Ser Pro Ser 
500 505 510 

He Asp Ser He Gin Lys Asp Pro Arg Glu Phe Ala Trp Arg Asp Pro 
515 52 0 52 5 

Glu Leu Pro Glu Val He His Met Leu Glu His Gin Phe Pro Ser Val 
530 535 540 

Gin Ala Asn Ala Ala Ala Tyr Leu Gin His Leu Cys Phe Gly Asp Asn 
545 550 555 560 

Lys Val Lys Met Glu Val Cys Arg Leu Gly Gly He Lys His Leu Val 
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565 570 575 

Asp Leu Leu Asp His Arg Val Leu Glu Val Gin Lys Asn Ala Cys Gly 
580 585 ' 590 

Ala Leu Arg Asn Leu Val Phe Gly Lys Ser Thr Asp Glu Asn Lys lie 
595 600 605 

Ala Met Lys Asn Val Gly Gly lie Pro Ala Leu Leu Arg Leu Leu Arg 
610 615 62 0 

Lys Ser lie Asp Ala Glu Val Arg Glu Leu Val Thr Gly Val Leu Trp 
625 630 635 640 

Asn Leu Ser Ser Cys Asp Ala Val Lys Met Thr lie lie Arg Asp Ala 
645 650 655 

Leu Ser Thr Leu Thr Asn Thr Val lie Val Pro His Ser Gly Trp Asn 
660 665 670 

Asn Ser Ser Phe Asp Asp Asp His Lys He Lys Phe Gin Thr Ser Leu 
675 680 685 

Val Leu Arg Asn Thr Thr Gly Cys Leu Arg Asn Leu Thr Ser Ala Gly 
690 695 700 

Glu Glu Ala Arg Lys Gin Met Arg Ser Cys Glu Gly Leu Val Asp Ser 
705 710 715 720 

Leu Leu Tyr Val lie His Thr Cys Val Asn Thr Ser Asp Tyr Asp Ser 
'25 730 735 

Lys Thr Val Glu Asn Cys Val Cys Thr Leu Arg Asn Leu Ser Tyr Arg 
7 4 0 7 4 5 7 5 0 

Leu Glu Leu Glu Val Pro Gin Ala Arg Leu Leu Gly Leu Asn Glu Leu 
755 760 765 

Asp Asp Leu Leu Gly Lys Glu Ser Pro Ser Lys Asp Ser Glu Pro Ser 
770 775 780 

Cvs Trp Gly Lys Lys Ljjs Lys Lys Lys Lys Arc Thr Pro Gin Glu Asg 

Gin Trp Asp Gly Val Gly Pro He Pro Gly Leu Ser Lys Ser Pro Lys 
305 810 815 

Gly Val Glu Met Leu Trp His Pro Ser Val Val Lys Pro Tyr Leu Thr 
820 825 830 

Leu Leu Ala Glu Ser Ser Asn Pro Ala Thr Leu Glu Gly Ser Ala Gly 
835 840 845 

Ser Leu Gin Asn Leu Ser Ala Ser Asn Trp Lys Phe Ala Ala Tyr He 
850 855 860 

Arg Gly Gly Arg Pro Lys Arg Lys Gly Leu Pro He Leu Val Glu Leu 
003 870 875 880 

Leu Arg Met Asp Asn Asp Arg Val Val Ser Ser Gly Ala Thr Ala Leu 
885 890 895 

Arg Asn Met Ala Leu Asp Val Arg Asn Lys Glu Leu He Gly Lys Tyr 
900 905 916 

Ala Met Ar| Asp Leu Val Asn Arg Leu Pro Gly Gly Asn Gly Pro Ser 

Val Leu Ser Asp Glu Thr Met Ala Ala lie Cys Cys Ala Leu His Glu 
3JU 935 940 

val Thr Ser Lys Asn Met Glu Asn Ala Lys Ala Leu Ala Asp Ser Gly 

950 955 960 

Gly He Glu Lys Leu Val Asn He Thr Lys Gly Arg Gly Asp Arg Ser 
965 970 975 

Ser Leu Lys val Val Lys Ala Ala Ala Gin Val Leu Asn Thr Leu Trp 
980 985 990 

Gin Tyr Ar^ Asp Leu Arg Ser JJj^Tyr Lys Lys Asp Gly^Trp Asn Gin 

Asn HiS Q Phe He Thr Pro Valuer Thr Leu Glu Ar^Asp Arg Phe Lys 

Ser His Pro Ser Leu Ser Thr Thr Asn Gin Gin Met Ser Pro He He 

AUJU 1035 1040 

Gin Ser Val Gly Ser^Thr Ser Ser Ser Jgg^Ala Leu Leu Gly Jl| Arg 

Asp Pro Arg Ser Glu Tyr Asp Arg Thr Gin Pro Pro Met Gin Tyr Tyr 
lut>0 1065 1070 
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Asn Ser GXn Gly Asp Ala Thr His Lys Gly Leu Tyr Pro Gly Ser Ser 
1U7S 1080 1085 

LyS fS?0 Ser Pr ° 116 Tyr J?5i^ Ser Set Tyr Ser Ser Pro A1 * Arg Glu 
1090 1095 1100 

CUn^Asn Arg Arg Leu Glr^His Gin Gin Leu T^cjyr Ser Gin Asp Asp 

Ser Asn Arg Lys Asn Phe Asp Ala Tyr Arq Leu Tyr Leu Gin Ser P 
1125 1130 1135 

His Ser Tyr Glu Asp Pro Tyr Phe Asp Asp Arg Val His Phe Pro Ala 
1140 1145 1150 

Ser Thr Asp Tyr Ser Thr Gin Tyr Gly Leu Lys Ser Thr Thr Asn Tyr 
1155 1160 1165 

Val l!?0 Phe l£?5 Ar9 Pr ° Ser Tyr ii§o Aid G1U Tyr 

Pro Gly Ser Pro Asp Ser Trp Val Tyr Asp Gin Asp Ala Gin Gin Arg 
1100 1190 1195 120 

Asn Ser Phe Phe Leu Thr Leu Phe Arq Leu Arg 
1205 1210 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY; linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION : 1 . . 970 

(D) OTHER INFORMATION: /note- "Y2H9" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



GAATTCCCAC 


AGAT ACCACT 


GCTGCTCCCG 


CCCTTTCGCT 


CCTCGGCCGC 


GCAATGGGCA 


60 


CCCGCGACGA 


CGAGT ACGAC 


TACCTCTTTA 


AAGTTGTCCT 


T ATTGGAG AT 


TCTGGTGTTG 


120 


GAAAGAGTAA 


TCTCCTGTCT 


CGATTTACTC 


GAAATGAGTT 


TAATCTGGAA 


AGCAAGAGCA 


180 


CCATTGGAGT 


AGAGTTTGC A 


ACAAGAAGC A 


TCCAGGTTGA 


TGGAAAAACA 


ATAAAGGCAC 


240 


AGATATGGGA 


CACAGCAGGG 


C AAGAGCG AT 


ATCGAGCTAT 


AACATCAGCA 


TATTATCGTG 


300 


GAGCTGTAGG 


TGCCTT ATTG 


GTTT ATGAC A 


TTGCTAAACA 


TCTCAC AT AT 


GAAAATGTAG 


360 


AGCGATGGCT 


GAAAGAACTG 


AGAGATCATG 


CTGAT AGTAA 


CATTGTTATC 


ATGCTTGTGG 


420 


GCAATAAGAG 


TGATCT ACGT 


CATCTCAGGG 


CAGTTCCTAC 


AGATGAAGCA 


AGAGCTTTTG 


480 


CAGAAAAGAA 


TGGTTTGTCA 


TTCATTGAAA 


CTTCGGCCCT 


AGACTCTACA 


AATGTAGAAG 


540 


CTGCTTTTC A 


GAC AATTTTA 


ACAGAGATTT 


ACCGCATTGT 


TTCTC AG AAG 


CAAATGTCAG 


600 


AC AGACGCGA 


AAATGACATG 


TCTCCAAGCA 


ACAATGTGGT 


TCCTATTC AT 


GTTCCACCAA 


660 


CCACTGAAAA 


CAAGCC AA AG 


GTGC AGTGCT 


GTCAGAACAT 


CTAAGG CATT 


TCTCTTCTCC 


720 


CCTAGAAGGC 


TGTGTAT AGT 


CCATTTCCCA 


GGTCTSASAT 


TT A A AT AT AW 


TTGTAATTCT 


780 


TGTGTCAC-TT 


TTGTGTTTTA 


TTACTTCATA 


CTTATGAATT 


TTTCCATGTC 


CTAAGTCTTT 


840 


TGATTTTGMT 


TT AT AAAATC 


ATCC ACTTGT 


NCCGAATGNC 


TGC AGCTTTT 


TTTC ATGCTA 


900 


TGGCTTCACT 


AGCCTT AGTT 


TNAT AAACTG 


AATGTTTGGA 


TTCCTCCCCC 


CAAAAAAAAA 


960 


AAAACTCGAG 












970 


(2) INFORMATION FOR SEQ ID NO: 8: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

( B) LOCATION : 1 . . 254 

(D) OTHER INFORMATION: /note- "Y2H23b" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10; 

GAATTCGCGG TCGCGTCGAC GGTTAGTCCC ACTGGNCGC A TCGAGGGNTT CACCAACGTC 

ATGGAGCTGT ATGGCANGAT CGCCGAGGTC TTCCNCCTGC C AACTGCCGA GGTGATGTTC 

TGCACCCTGA NCACCCACAA AGTGGACATN GACAAGCTCC TGGGGGGCCA G ATCGGGCTG 

GAGGACTTC A TCTTCGCCCA CGTGAAGGGG YAGCGCAAGG AGGTGGAGGT GTTCAWGTCG 

GAGGATG YAC TCGGKCTC AC CATCACGGAC AACGGGGCTG GCTACGCTTC C ATCAAGCGC 

ATC A AGGAGG GCAGCGTGAT CGACCACATC CACCTCATCA GCGTGGGCGA CATGATCGAG 
GCCATT AACG GGCAGAGCTT CCTGGGCTGC CGGCATT ACG AGGT 
(2) INFORMATION FOR SEQ ID NO : 1 1 : 

li) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 350 base pairs 

(B) TYPE: nucleic aciS 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: nisc feature 

(B) LOCATION: 1 . . 350 

<D> OTHER INFORMATION: /note- "Y2H171" 
(Xi) SEQUENCE DESCRIPTION: SEQ id NO:ll: 
GAATTCGCGG CCGCGTCGAC AAAAAAAGTA AAAGG AACTC GGC AAATCTT ACCCCGCCTG 
TTTACC AAAA ACATCACCTC TAGCATC ACC AGTATTAG AG GCACCGCCTG CCC AGTGACA 



60 
120 
160 
240 
264 



(xi) SEQUENCE DESCRIPTION: SEQ ID^NO:6: 

GAATTCGCGG CCGNGTCGAC CCCCCACCCC CGATGCCACC ACCCCCANTG GGNTCTCCCN 

NCCCAGTC AT CAGTTCTTCC ATGGNGTNCC CTGGTCTGCC CCCTCCAGCT CCCCCAGGCN 

TTCTCCGGGT CTGNC AGC AG CCNCC AGATT AACTCAACAG TGTCACTCCC TGGGGGTGGG 

TCTGGNCCCC CTGANG ATGT GAAGCCACCA GTCTNAGNGG TCCGGGGTCT GTACTGTCC A 
CCCCCTCCAG GTGGACC TGG CGCT 
(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY : misc feature 

(B) LOCATION: 1 . .370 

(D> OTHER INFORMATION: /note- "Y2H27" 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
GAATTCGCGG CCGCGTCGAC CGCGGTCGCG TCGACCTGTT GCCCAGGCCC TAGAGGTCAT 
TCCTCGTACC CTGATCC AGA ACTGTGGGGC CAGCACCATC CGTCTACTTA CCTCCCTTCG 
GGCCAAGCAC ACCCAGGAGA ACTGTGAGAC CTGGGGTGT A AATGGTGAGA CGGGTACTTT 180 
GGTGGAC ATG A AGG AACTGG GCATATGGGA GCCATTGGCT GTGAAGCTGC AGACTT ATAA 240 
GACAGCAGTG GAGACGGCAG TTCTGCTACT GCGAATTGAT GAC ATCGTTT CAGGCC ACAA 
AAAG AAAGGC GATGACCAGA GCCGGCAAGG CGGNGCTCCT 
(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

( B ) LOCATION : 1 . . 4U4 

(D) OTHER INFORMATION: /note- *' Y 2 K 3 5 H 



60 
120 



300 
340 



60 
120 
180 
240 
300 
360 
404 



60 
120 
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CATGTTTAAC 



GGCCGCGGT A CCCTAACCGT GC AAAGGT AG C AT AATCACT 



TGTTCCTTAA 



180 



GTAGGGACCT 



GTATG AATGG CTCCACGAGG GTTC AGCTGT CTCTTACTTT 



TAACCARTGA 



240 



AATTGACCTG 



CCCGTGAAG A GGCGGGC ATG AC AC AGCAAG ACG AGAAGAC 



CCTATGGAGC 



300 



TTTAATTTAT 



TAATGC AAAC AGTACCTAAC AAACCCAC AG GGTCCT AAAC 



350 



(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

( B ) LOCATION: 1 . . 350 

(D) OTHER INFORMATION: /note- "Y2H41" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 



GAATTCGCGG 


NCGCGTCGAC 


AGATAATGAA 


AAAACC AGAG 


GTTCCCTTCT 


TTGGTCCCCT 


60 


NNNNGATGGT 


GCTATTGTGA 


ATGG AAAGGT 


TCTACCC ATT 


ATGGTTAGAG 


CAACAGCTAT 


120 


A AATGC AAGC 


CGTGCTCTGA 


AATCTCTGAT 


TCC ATTGTAT 


C A AAACTTCT 


ATGAGGAGAG 


180 


AGC ACGAT AC 


CTGC AAACAA 


TTGTCC AGCA 


CCACTTAGAA 


CCAACAACAT 


TTG AAGATTT 


240 


TGN AGCAC AG 


GTTTTTTCTC 


CAGCTCCCTA 


CCACCATTT A 


CCATCTGATG 


CCGTTGGCTC 


300 


CTACCCAGAG 


ATTCTACCCA 


GTGAAAACTC 


CCACAGCAAC 


GCAGGT AGGA 




350 
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CLAIMS 

What is claimed is: 

1 • An isolated nucleic acid comprising a nucleotide sequence encoding at 

least a presenilin-interacting domain of a presenilin-interacting protein selected from 
the group consisting of a mammalian SSa^proximately residues 70-377 of SEQ ID 
NO: 2y<jT24 (approximately residues 346-862 of SEQ ED NO: 4), p0071 
5 (approximately residues 509-1022 of SEQ ID NO: 6), Rabl 1 (SEQ ED NO: 7), 
retinoid X receptor-p (SEQ ED NO:8), cytoplasmic chaperonin (SEQ ED NO: 9), 
Y2H35 (SEQ ID NO: 10), Y2H171 (SEQ ED NO: 1 1), and a Y2H41 (SEQ ED NO: 
12) presenilin-interacting domain. 

0 2. An isolated nucleic acid comprising a nucleotide sequence of at least 1 0 

consecutive nucleotides selected from the group consisting of SEQ ED NO: 1, SEQ ED 
NO: 3, SEQ ID NO: 5, SEQ ED NO: 7, SEQ ED NO: 8, SEQ ED NO: 9, SEQ ED NO: 
10, SEQ ED NO: 1 1, SEQ ED NO: 12, GenBank Accession Numbers F08730, T18858, 
X81889, X56740, X53143, M84820, X63522, M81766, U17104, X74801, R12984, 

5 D55326, and T64843, and a sequence complementary to any of these sequences. 



3. An isolated nucleic acid as in claim 2 comprising a nucleotide sequence of 

at least 15 consecutive nucleotides selected from said group. 



4. An isolated nucleic acid as in claim 2 comprising a nucleotide sequence of 

at least 20 consecutive nucleotides selected from said group. 
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5. An isolated nucleic acid comprising a nucleotide sequence encoding an 
antigenic determinant of a presenilin-interacting protein selected from the group 
consisting of a mammalian S5a, GT24, p0071, Rabl 1, retinoid X receptor-p, 
cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H41 protein. 

6. A method for identifying allelic variants or heterospecific homologues of a 
human presenilin-interacting protein gene comprising 

choosing a nucleic acid probe or primer capable of hybridizing to a human 
presenilin-interacting protein gene sequence under stringent hybridization conditions; 

mixing said probe or primer with a sample of nucleic acids which may 
contain a nucleic acid corresponding to said variant or homologue; 

detecting hybridization of said probe or primer to said nucleic acid 
corresponding to said variant or homologue. 

7. A method as in claim 6 wherein said sample comprises a sample of nucleic 
acids selected from the group consisting of human genomic DNA, human mRNA, and 
human cDNA. 

8. A method as in claim 6 wherein said sample comprises a sample of nucleic 
acids selected from the group consisting of mammalian genomic DNA, mammalian 
mRNA, and mammalian cDNA. 

9. A method as in claim 6 wherein said sample comprises a sample of nucleic 
acids selected from the group consisting of invertebrate genomic DNA, invertebrate 
mRNA, and invertebrate cDNA. 
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1 0. A method as in claim 6 further comprising the step of isolating said nucleic 

acid corresponding to said variant or homologue. 



5 11. A method as in claim 6 wherein said nucleic acid is identified by 

hybridization. 



12. A method as in claim 6 wherein said nucleic acid is identified by PCR 

amplification. 



10 



13. A method for identifying allelic variants or heterospecific homologues of a 

human presenilin-interacting protein gene comprising 

choosing an antibody capable of selectively binding to a human 
presenilin-interacting protein; 
15 mixing said antibody with a sample of proteins which may contain a 

protein corresponding to said variant or homologue; 

detecting binding of said antibody to said protein corresponding to said 
variant or homologue. 



14. A method as in claim 1 3 wherein said sample comprises a sample of 

proteins selected from the group consisting of human proteins, human fusion proteins, 
and proteolytic fragments thereof. 
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15. A method as in claim 13 wherein said sample comprises a sample of 

proteins selected from the group consisting of mammalian proteins, mammalian 
fusion proteins, and proteolytic fragments thereof. 

5 16. A method as in claim 1 3 wherein said sample comprises a sample of 

proteins selected from the group consisting of invertebrate proteins, invertebrate 
fusion proteins, and proteolytic fragments thereof. 

1 7. A method as in claim 13 further comprising the step of substantially 
10 purifying said protein corresponding to said variant or homologue. 

1 8. An isolated nucleic acid comprising an allelic variant or a heterospecific 
homologue of a human presenilin-interacting protein gene. 

15 1 9. An isolated nucleic acid encoding an allelic variant or heterospecific 

homologue of a human presenilin-interacting protein. 

20. An isolated nucleic acid comprising a recombinant vector including a 
nucleotide sequence of any one of claims 1-19. 

20 

21 . An isolated nucleic acid as in claim 20 wherein said vector is an 
expression vector and said presenilin-interacting protein nucleotide sequence is 
operably joined to a regulatory region. 
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22. An isolated nucleic acid as in claim 2 1 wherein said expression vector may 

express said presenilin-interacting protein sequence in mammalian cells. 



23 . An isolated nucleic acid as in claim 22 wherein said cells are selected from 

the group consisting of fibroblast, liver, kidney, spleen, bone marrow and neurological 



cells. 



24. An isolated nucleic acid as in claim 2 1 wherein said vector is selected from 

10 the group consisting of vaccinia virus, adenovirus, retrovirus, neurotropic viruses and 
Herpes simplex. 



25. An isolated nucleic acid as in claim 2 1 wherein said expression vector 

encodes at least a presenilin-interacting domain of a presenilin-interacting protein 
selected from the group consisting of a mammalian S5a, GT24, p0071, Rabl 1, 
retinoid X receptor-3, cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H41 protein. 



26. An isolated nucleic acid as in claim 2 1 wherein said vector further 

comprises sequences encoding an exogenous protein operably joined to said 
presenilin-interacting protein sequence and whereby said vector encodes a presenilin- 
interacting protein fusion protein. 



27. An isolated nucleic acid as in claim 26 wherein said exogenous protein is 

selected from the group consisting of lacZ, trpE, maltose-binding protein, a poly-His 
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tag, giutathione-S-transferase, a GAL4-DNA binding domain, and a GAL4 activation 
domain. 

28. An isolated nucleic acid comprising a recombinant expression vector 

5 including nucleotide sequences corresponding to an endogenous regulatory region of a 
presenilin-interacting protein gene. 

29. An isolated nucleic acid as in claim 28 wherein said endogenous 
regulatory region is operably joined to a marker gene. 

10 

30. A host cell transformed with an expression vector of any one of claims 20- 
29, or a descendant thereof. 

31. A host cell as in claim 30 wherein said host cell is selected from the group 
15 consisting of bacterial cells and yeast cells. 

32. A host cell as in claim 30 wherein said host cell is selected from the group 
consisting of fetal cells, embryonic stem cells, zygotes, gametes, and germ line cells. 

20 33. A host cell as in claim 30 wherein said cell is selected from the group 

consisting of fibroblast, liver, kidney, spleen, bone marrow and neurological cells. 
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A host cell as in claim 30 wherein said cell is an invertebrate cell. 



35. A non-human animal model for Alzheimer's Disease, wherein a genome of 

said animal, or an ancestor thereof, has been modified by at least one recombinant 
5 construct, and wherein said recombinant construct has introduced a modification 
selected from the group consisting of (1) insertion of nucleotide sequences encoding 
at least a functional domain of a heterospecific normal presenilin-interacting protein, 
(2) insertion of nucleotide sequences encoding at least a functional domain of a 
heterospecific mutant presenilin-interacting protein, (3) insertion of nucleotide 
10 sequences encoding at least a functional domain of a conspecific homologue of a 
heterospecific mutant presenilin-interacting protein, and (4) inactivation of an 
endogenous presenilin-interacting protein gene. 



36. An animal as in claim 35 wherein said modification is insertion of a 

nucleotide sequence encoding at least a functional domain of a normal human 
presenilin-interacting protein selected from the group consisting of a mammalian S5a, 
GT24, p0071, Rabll, retinoid X receptor-P, cytoplasmic chaperonin, Y2H35, 
Y2H171, and Y2H41 protein. 



37. An animal as in claim 35 wherein said modification is insertion of a 

nucleotide sequence encoding at least a functional domain of a mutant human 
presenilin-interacting protein selected from the group consisting of a mammalian S5a, 
GT24, p007I , Rabl 1, retinoid X receptor-p, cytoplasmic chaperonin, Y2H35, 
Y2H171, and Y2H41 protein. 
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38. An animal as in claim 35 wherein said animal is selected from the group 

consisting of rats, mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, 
and non-human primates. 

5 39. An animal as in claim 35 wherein said animal is an invertebrate. 

40. A method for producing at least a functional domain of a presenilin- 
interacting protein comprising culturing a host cell of any of claims 30-34 under 
suitable conditions to produce said presenilin by expressing said nucleic acid. 

10 

41 . A substantially pure preparation of a protein selected from the group 
consisting of a mammalian S5a, GT24, p0071, Rabl 1, retinoid X receptor-p, 
cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H41 protein. 

15 42. A substantially pure preparation of a polypeptide comprising an amino 

acid sequence of at least 10 consecutive amino acid residues selected from the group 
consisting SEQ ED NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and GenBank Accession 
Numbers F08730, T18858, X81889, X56740, X53143, M84820, X63522, M81766, 
U17104, X74801, R12984, D55326, and T64843. 

20 

43. A substantially pure preparation of a polypeptide as in claim 42 

comprising an amino acid sequence of at least 15 consecutive amino acid residues 
selected from said group. 
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44. A substantially pure preparation of a polypeptide comprising at least a 

presenilin-interacting domain of a presenilin-interacting protein selected from the 
group consisting of a mammalian S5a, GT24, p0071, Rabl 1, retinoid X receptor-p, 
5 cytoplasmic chaperonin, Y2H35, Y2H 1 71 , and Y2H4 1 protein. 



45. A substantially pure preparation of a polypeptide comprising an antigenic 

determinant of a presenilin-interacting protein selected from the group consisting of a 
mammalian S5a, GT24, P 0071, Rabl 1, retinoid X receptor-P, cytoplasmic 
10 chaperonin, Y2H35, Y2H 1 7 1 , and Y2H41 protein. 



46. A method of producing antibodies which selectively bind to a presenilin 

interacting protein comprising the steps of 

administering an immunogenically effective amount of a presenilin- 
15 interacting protein immunogen to an animal; 

allowing said animal to produce antibodies to said immunogen; and 
obtaining said antibodies from said animal or from a cell culture derived 

therefrom. 



20 47. A substantially pure preparation of an antibody which selectively binds to 

an antigenic determinant of a presenilin-interacting protein selected from the group 
consisting of a mammalian S5a, GT24, p0071, Rabl 1, retinoid X receptor-p, 
cytoplasmic chaperonin, Y2H35. Y2H171, and Y2H41 protein. 
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48. A substantially pure preparation of an antibody as in claim 47 wherein said 

antibody selectively binds to an antigenic determinant of a mutant presenilin- 
interacting protein and fails to bind to a normal presenilin-interacting protein. 

5 49. A cell line producing an antibody of any one of claims 47-48. 

50. A method for identifying compounds which can modulate the expression 
of a presenilin-interacting protein gene comprising 

contacting a cell with a test candidate wherein said cell includes a 
10 regulatory region of a presenilin-interacting protein gene operably joined to a coding 
region; and 

detecting a change in expression of said coding region. 

51. A method as in claim 50 wherein said change comprises a change in a 
15 level of an mRNA transcript encoded by said coding region. 

52. A method as in claim 50 wherein said change comprises a change in a 
level of a protein encoded by said coding region. 

20 53. A method as in claim 50 wherein said change is a result of an activity of a 

protein encoded by said coding region. 
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54. A method as in claim 50 wherein said coding region encodes a marker 

protein selected from the group consisting of p-galactosidase, alkaline phosphatase, 
green fluorescent protein, and luciferase. 



5 55. A method for identifying compounds which can selectively bind to a 

presenilin-interacting protein comprising the steps of 

providing a preparation including at least one presenilin-interacting protein 
component; 

contacting said preparation with a sample including at least one candidate 
10 compound; and 

detecting binding of said presenilin-interacting protein component to said 
candidate compound. 



56. The method in 55 wherein said binding to said presenilin-interacting 

component is detected by an assay selected from the group consisting of: affinity 
chromatography, co-immunoprecipitation, a Biomolecular Interaction Assay, and 
yeast two-hybrid system. 



57. A method of identifying compounds which can modulate activity of : 

20 presenilin-interacting protein comprising the steps of 

providing a cell expressing a normal or mutant presenilin-interacting 
protein gene; 

contacting said cell with at least one candidate compound; and 
detecting a change in a marker of said activity. 
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58. A method as in claim 57 wherein measurement of said marker indicates a 
difference between cells bearing an expressed mutant presenilin-interacting protein 
gene and otherwise identical cells free of an expressed mutant presenilin-interacting 
protein gene. 

5 

59. A method as in claim 57 wherein said change comprises a change in a non- 
specific marker of cell physiology selected from the group consisting of pH; 
intracellular Ca*\ Na*, or K*; cyclic AMP levels; GTP/GDP ratios; 
phosphatidylinositol activity; and protein phosphorylation. 

10 

60. A method as in claim 57 wherein said change comprises a change in 
expression of said presenilin-interacting protein. 



61 . A method as in claim 57 wherein said change comprises a change in 

15 intracellular concentration or flux of an ion selected from the group consisting of Ca 2 \ 
Na + and K\ 



62. A method as in claim 57 wherein said change comprises a change in 
occurrence or rate of apoptosis or cell death. 

63. A method as in claim 57 wherein said change comprises a change in 
production of Ap peptides. 
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64. A method as in claim 57 wherein said change comprises a change in 

phosphorylation of at least one microtubule associated protein. 



65. A method as in claim 57 wherein said cell is a cell cultured in vitro . 

5 

66. A method as in claim 65 wherein said cell is a transformed host cell of any 
one of claims 30-34. 



67. A method as in claim 65 wherein said cell is explanted from a host bearing 

10 at least one mutant presenilin-interacting protein gene. 



68. A method as in claim 65 wherein said cell is explanted from a transgenic 

animal of any one of claims 35-39. 



A method as in claim 57 wherein said cell is a cell in a live animal. 



70. A method as in claim 69 wherein said cell is a cell of a transgenic animal 

of any one of claims 35-39. 



71 . A method as in claim 57 wherein said cell is in a human subject in a 

clinical trial. 
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72. A diagnostic method for determining if a subject bears a mutant presenilin- 
interacting protein gene comprising the steps of 

providing a biological sample of said subject; 
5 detecting in said sample a mutant presenilin-interacting protein nucleic 

acid, a mutant presenilin-interacting protein, or a mutant presenilin-interacting protein 
activity. 

73. A method as in claim 72, wherein a mutant presenilin-interacting protein 
10 nucleic acid is detected by an assay selected from the group consisting of direct 

nucleotide sequencing, probe specific hybridization, restriction enzyme digest and 
mapping, PCR mapping, ligase-mediated PCR detection, RNase protection, 
electrophoretic mobility shift detection, and chemical mismatch cleavage, 

15 74. A method as in claim 72, wherein a mutant presenilin-interacting protein is 

detected by an assay selected from the group consisting of an immunoassay, a 
protease assay, and an electrophoretic mobility assay. 

75. A pharmaceutical preparation comprising a substantially pure presenilin- 
20 interacting protein and a pharmaceutically acceptable carrier. 

76. A pharmaceutical preparation comprising an expression vector operably 
encoding a presenilin-interacting protein, wherein said expression vector may express 
said presenilin-interacting protein in a human subject, and a pharmaceutically 

25 acceptable carrier. 
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77. A pharmaceutical preparation comprising an expression vector operably 

encoding a presenilin-interacting protein antisense sequence, wherein said expression 
vector may express said presenilin-interacting protein antisense sequence in a human 
subject, and a pharmaceutically acceptable earner. 



78 . A pharmaceutical preparation comprising a substantially pure antibody, 

wherein said antibody selectively binds to a mutant presenilin-interacting protein, and 
a pharmaceutically acceptable carrier. 



79. A pharmaceutical preparation as in claim 78 wherein said preparation is 

essentially free of an antibody which selectively binds a normal presenilin-interacting 
protein. 



15 80. A pharmaceutical preparation comprising a substantially pure preparation 

of an antigenic determinant of a mutant presenilin-interacting protein. 



81. A pharmaceutical preparation as in claim 80 wherein said preparation is 

essentially free of an antigenic determinant of a normal presenilin-interacting protein. 



20 



82. A method of treatment for a patient bearing a mutant presenilin-interacting 

protein gene comprising the step of administering to said patient a therapeutically 
effective amount of the pharmaceutical preparation of any one of claims 75-81. 
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83. A method as in claim 82, wherein said pharmaceutical preparation is 

targeted to a cell type is selected from the group consisting of heart, brain, lung, liver, 
skeletal muscle, kidney, pancreas and neurological cells. 
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